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PREFACE 

The past few years have seen a rapid development in the 
mathematical formulation and testing of behavioral science 
theory. The problem has been attacked from many different 
standpoints by persons widely scattered both geographically 
and in field of specialization. In an effort to coordinate some of 
this activity the University of Michigan made a proposal to the 
Ford Foundation to hold an eight week seminar on The Design of 
Experiments in Decision Processes which would have as partici- 
pants a small group of mathematicians, statisticians, psycholo- 
gists, economists, and philosophers. The proposal was approved 
and the seminar was held in the summer of 1952 in Santa Monica, 
California. This location was chosen because of the common in- 
terest of the RAND Corporation in the problems of the seminar, 
and the seminar benefited considerably from participation of 
RAND employees and consultants as well as from the services 
of the RAND business office. 

In addition to the participants who were supported by the Ford 
Grant and those loaned by RAND, several regular participants 
were supported by the Office of Naval Research (through con- 
tracts with the University of Michigan, the University of Minne- 
sota, and Princeton University) and still others by the Cowles 
Commission. There were also frequent visitors from the Uni- 
versity of California at Los Angeles and from others in the Los 
Angeles area. 

The results of the seminar included a great deal of stimula- 
tion of empirical research and of further theory construction. 
It was decided to publish a volume on the proceedings of the 
seminar allowing about a year for the completion of some of the 
research growing out of it. The editors have also included some 
additional papers which are closely related to the purposes of 
the seminar. 

The authors of the volume do not expect that it will be a 
definitive work, but rather regard it as a vehicle for raising a 
number of basic questions and perhaps also providing some 
guideposts towards answers to some of these questions. Ac- 
cordingly, an informal and relatively speedy method of printing 
seemed suitable. Since the method of printing did not permit 
proof reading by the authors, and also in the interests of speed, 
the authors gave permission to the editors to do all of the proof 
reading and to make such minor technical changes (e.g. in foot- 
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notes, references, styles of formulas) as might render the total 
presentation more uniform and understandable. Much that might 
have been standardized has been left intact in the interest of 
presenting each author pretty well in his own manner. Never- 
theless, there have also been many minor changes; consequently 
any error encountered in the final text may well be of the edi- 
tors 1 making. In any event, all such errors in detail are to be 
charged to the editors. 

A list of regular participants and the titles of the papers 
given at the formal sessions of the seminar are given as Ap- 
pendices A and B. 

The title "Decision Processes" does not imply that the 
papers are limited to the branch of statistics bearing that name. 
The individual papers range in character from pure mathe- 
matics to experiments in group dynamics, but all are directed 
at the application of mathematics to behavioral sciences in gen- 
eral and at decision processes in particular. The range and 
interrelationship of these papers are discussed in the introduc- 
tory chapter. It is the hope of the editors that mathematicians 
and behavioral scientists will find some of the papers of interest 
to them and that a few will find them all of interest. 

The editors 1 task would have been impossible without the help 
of many referees who included seminar participants, authors, 
and also the following persons: Raoul Bott, A. H. Cope land, Sr., 
I. S. Copi, D. A. Darling, Ward Edwards, J. R. P. French, 
Lawrence Klein, D. G. Marquis, John McCarthy, T. M. Newcomb, 
G. Y. Rainich and A. W. Tucker. The editors also wish to ex- 
press their appreciation to Wiley and Sons and to the composi- 
tors Edwards Brothers for their fine cooperation in the making 
of this book. 

R. M. Thrall, C. H. Coombs, R. L. Davis 
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CHAPTER I 



INTRODUCTION TO "DECISION PROCESSES" 

by 

Robert L. Davis* 
UNIVERSITY OF MICHIGAN 



For over two thousand years the chief preoccupation with de- 
cision problems concerned whether or not a given action or de- 
cision would be good, right, or otherwise well calculated to 
achieve some desired effect. Philosophers often presented these 
problems in terms of a dilemma. The decision- maker faced a 
known situation and if there were more than two alternative 
courses of action available to him, these usually were at best a 
class of actions which differed essentially in but one component 
(thus permitting him to define and hew to a * mean"). 

Perhaps even more of a restriction than that on alternative 
courses of action is the implicit assumption in these early for- 
mulations that decisions were to be made on the basis of one 
known situation. To be sure, reasonable thinkers must always 
have allowed in some loose qualitative way for information whose 
accuracy was only probable; but the device of making separate 
analyses for each of the relevant potential a states of nature" 
would have been bootless before the invention of the calculus of 
probability. It is the concept of mathematical expectation that 
lends this device its appeal and power. 

Also awaiting analysis was the question of how to say pre- 
cisely what "the desired effect" was to mean; in fact there are 



*The editors delegated to this author the task of writing a 
chapter designed to serve as introduction and to some extent as 
amalgam for the book and the kinds of ideas it deals with. While 
he is responsible for selection and organization of the material 
to accomplish this, as well as for the writing, many of the ideas 
here are the outgrowth of numerous editorial conferences of the 
past year. In particular, the word "we" used in discussing mat- 
ters of opinion refers to all three editors. 
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some who say it is still waiting. But at least today we have 
formally well developed theories of preference and utility; most 
men work within this framework and those who question it seem 
usually to do so concerning the interpretation to be given for the 
abstract system. 

These are the main ingredients of the change that has taken 
place over the past two millenia since rational philosophy first 
attacked decision problems. They were not all put together at 
once. Whether a systematic probability calculus began with 
Fermat and Pascal, as hitherto generally supposed, or actually 
with Cardano, the fact is that the science of statistics has chiefly 
grown up within the present century. Again, Daniel Bernoulli 
used a utility concept more than 200 years ago to analyze the St. 
Petersburg paradox; but whether we look to him or to the line 
of 19th Century economists including Bentham, Gossen, Jevons, 
Edgeworth and culminating in Pareto, we do not find a rigorous 
formulation of utility theory until the past quarter- century. 
Finally, it was only after the late war that the work of von 
Neumann in game theory and of Wald in statistical decision 
theory came to the attention of any wide class of mathematicians, 
philosophers and scientists, and led to a basic reorientation con- 
cerning decision problems. 

Thus what Ave may call the modern look in decision theory is 
somewhat less than ten years old. It is just in the past two years, 
in fact, that we have begun getting books - ranging from technical 
monographs through textbooks and popularizations - explicitly 
dealing with decision processes. Now such of these books as we 
have read are entirely different from this volume in at least two 
respects: (1) they present what is in some sense a "finished" 
part of the theory, and (2) they are written from a unified point 
of view. (In particular, many deal solely with the restricted 
topic known as "statistical decision theory"; while this theory is 
basic to the approach of several of our authors, much of the vol- 
ume lies quite outside its purview). To account for these differ- 
ences requires another brief historical digression. 

It is probably clear that if anyone announces that he has a de- 
cision theory which either "explains" how people actually make 
decisions (in the way any scientific theory "explains* its subject 
matter) or prescribes how people should make decisions, this 
will be of great interest not only to other professed decision 
theorists, but in general to economists, sociologists and psycholo- 
gists, to philosophers, and to mathematicians. The interest of 
the social scientists is clear: someone offers to tell them some- 
thing about their own field. Whether the philosopher is interested 
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because of the proposed removal from his own realm of one more 
body of hitherto purely speculative material, or rather because 
of his interest in methodology and scientific principles, decision 
processes attract serious philosophical attention. Mathemati- 
cians taking to these matters often content themselves with 
pointing out that the source of much interesting new mathematics 
lies in early development of deductive theories designed for ap- 
plication to the real world. 

Since the fall of 1950 a seminar on the application of mathe- 
matics to social science has met regularly at the University of 
Michigan under the sponsorship of the Departments of Economics, 
Mathematics, Philosophy, Psychology, and Sociology. Mathemat- 
ical ideas have sometimes been explored for their possible ap- 
plication in social science, and theoretical systems of the 
social sciences have been studied for their possible formulation 
mathematically. It became apparent in the first two years of its 
operation that much of the material most interesting to the semi- 
nar dealt with decision processes. How this led to the summer 
project at Santa Monica in 1952 is detailed in the preface. The 
important thing here is the variety of the personnel at this sum- 
mer conference, and of their interests. Some were concerned 
with more or less the whole problem, some with aspects of it 
which only the initiate could tell might be related to decision 
processes. It is this variety, together with the fact that none of 
these participants was interested in mere reformulation of stand- 
ard parts of the theory (unless for the purpose of setting up a 
new attack), that accounts for the differences between this vol- 
ume and other books seemingly devoted to the same subject. 

Certain merits and shortcomings of our volume are almost 
inevitable consequences of these facts. It would be unrealistic 
to expect a coherent and unified attack on decision theory from 
such a group at the present stage. With authors of varied back- 
ground and habits of exposition, we get a total output which must 
seem uneven in style, which in dealing with such different prob- 
lems also does so at markedly different depths of penetration, 
and in which the individual authors are not noticeably in agree- 
ment on some of the fundamental questions in the whole theory 
(such as the nature of "utility" and "subjective probability*, for 
instance). On the other hand, this variety is necessary to a 
presentation of any reasonable sample of the current activity in 
decision problems. Thus, although we cannot even claim to pre- 
sent here all of the kinds of approach to decision processes 
which have been developed to date (and no doubt the simplest and 
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most effective ones still await development), we can say that to 
the best of our knowledge this book is representative of the lat- 
est work in our subject in all the disciplines concerned. 

We could start a catalogue of the diversified viewpoints rep- 
resented here by recognizing the distinction between those whose 
interests were primarily formal and theoretical and those with 
a more experimental orientation. It would be incorrect to say 
that more than one or two of our authors are primarily experi- 
mentalists, but several others among the social scientists at 
least matched their interest in theory with a record of experi- 
mental training and experience. The mathematicians and philos- 
ophers, on the other hand, had by profession been previously 
concerned with purely formal theories. The position of the 
economists in this regard may seem anomalous: classically, 
they have worked with purely formal theory, founded on intro- 
spection, although the subject they studied was an empirical one 
of great practical importance in human affairs. 

One result of these differences was the division between 
those most interested in working out the logical consequences of 
a theory or a set of axioms, and those most interested in finding 
out how people acted. There is a related and somewhat subtler 
distinction which, as often as it is recognized and properly de- 
scribed, nevertheless keeps obscuring communication and under- 
standing between various scientists. That is the distinction be- 
tween the theory which is in some sense principally descriptive 
or predictive and that which is solely normative. The classical 
economic approach to the problems of decision, for instance, was 
phrased in terms of the activity of the "rational man". These 
notions were formalized in what is now called statistical deci- 
sion theory, and a great many of our participants in the Santa 
Monica conference were primarily interested in the kinds of 
problems encountered here. But many others - chiefly, perhaps, 
the psychologists and sociologists - considered it a basic flaw 
that this theory had no apparent relation to observed behavior^ 

The words "normative" and "rational man" seem unfortunate. 
It may be that the first economists to think in these terms actu- 
ally intended to set forth what a man should do if he were ra- 
tional; the intention of modern formulators of normative theory 
is usually much more modest. Each constructs an abstract 
system about which he says in effect: "If a person's behavior is 
such that under the proper interpretation it can be said to satisfy 
the requirements of this system, then his behavior is what is 
called 'rational' in my theory." For instance, the theory of zero- 
sum two- per son games can be taken as normative theory in the 
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sense that it gives instructions according to which a man will be 
able to maximize his expected payoff in such a game, assuming 
he can find the solution. But this does not say anyone should 
use this theory in playing an actual game: it may be that he can 
more easily secure this maximum expected payoff in some other 
way, or it may be that he is an iconoclast who sees no sufficient 
connection between mathematical expectation and the outcome of 
any play of the game to justify the procedure. 

Game theory illustrates another source of confusion in inter- 
disciplinary discussions. For von Neumann and Morgenstern 
evidently intended not only a normative interpretation for their 
theory, but also that it provide foundations for a new analysis of 
the empirical processes of economic exchange. Now discussion 
of any theory which like this is intended to have both normative 
and empirical applications must proceed in terms of careful 
discrimination as to what aspect of the theory is concerned. 
Thus criticism of empirical applications of the theory must be 
set apart from strictures on its normative implications. And of 
course both types of argument should be distinguished from any 
attack on the abstract theory. Perhaps because these distinc- 
tions are so obvious, they are not always formulated explicitly 
in expert discussion; even with experts this can lead to argument 
at cross-purposes. 

The breakdown between empirical and formal approaches is 
helpful in our organization of only part of the book: the fourth 
part can be said to consist of experimentally oriented papers, 
although one is by a statistician and another by four mathemati- 
cians. The second part leans a little in this direction too. 

The distinction between the normative and descriptive aspects 
of the theories they present affords even less help in classifying 
the papers, because so many - as in the example of game theory 
are clearly intended to have some interpretation in each sense. 
Chapters in through VI and possible Chapter XII might be sup- 
posed to be most nearly concerned with the normative aspects 
of the topics they discuss. 

Not feeling either of these distinctions to provide a sound 
basis for division of our book into parts, we nevertheless found 
a more or less natural division in terms of subject matter. Even 
this partitioning of the contents is in some degree artificial; we 
will now try to point to some of the interconnections and compari- 
sons between various papers. 

First, as to Chapter II. The genesis of this paper is described 
on its first page. These ideas had been fairly well shaken to- 
gether by the time of the summer meeting of 1952. There they 
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were presented early and may have served somewhat the same 
semi- introductory function they serve here. Even when you 
talk to a very erudite audience you often find it expedient to 
make a brief survey of the foundations of your subject - just to 
establish terminological agreement and start people thinking 
along your lines. This chapter gives a sort of quasi-formal 
status to three terminological practices which were perhaps 
fairly common among social scientists before, but which are 
somewhat different from ordinary scientific and mathematical 
usage. Thus the * mathematical model" - quickly shortened in 
use to "model" - here seems somewhat different from what the 
typical mathematician or logician would expect. 

Any system may be a "model". It may, for instance, in some 
remarkable case, be only a model for itself. In general, the 
same system may provide a model for many other systems in 
being a realization of each either under "interpretation" or 
under "abstraction". Thus, for example, a mathematician might 
construct a system of colored fluids and pipes and valves as a 
model for a particular kind of equation in the abstract mathe- 
matical theory called hydrodynamics; but the same system of 
liquids and pipes - if fortunately conceived - might serve an 
engineer as model for the flow in a river network. In applica- 
tions to social science "model" is usually used for the more 
abstract system, in contradistinction to common mathematical 
usage. Again, the word is so widely used that it has now be- 
come customary to use it even when there is no abstraction or 
interpretation at issue. This has the (rather uneconomical) ef- 
fect of making it practically synonomous with "abstract system 

Thus "mathematical model" here means simply "formal de- 
ductive system"; a mathematical model together with a given 
interpretation of the model in the "real world" constitutes a 
"theory", while "real world" itself is a term intended to denote 
what social scientists mean when they talk about the real world, 
whether that is to be taken as itself another (and probably sim- 
pler, more immediately understandable) linguistic system, or 
simply as a set of specifications for making measurements, or 
as whatever your form of epistemology will buy. 

This chapter's introductory treatment of measurement theory 
in terms of the theory of relations provides a basis for generali- 
zation, as well as a guide to precise statement of problems fall- 
ing within the existing framework, that may serve for a great 
many kinds of discussion in the social sciences. 
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PART ONE 

Two of the five chapters which form the first part of the book 
deal explicitly with the standard statistical decision problem. 
Two others discuss the application of statistical decision methods 
to the problems of group choice, and the last of these chapters 
is chiefly concerned with shades of distinction possibly arising 
in the probability statements so essential in the statistical deci- 
sion analysis. 

Although Leo Goodman's paper "On Methods of Amalgamation" 
is actually concerned with the group choice problem, it serves 
as a good introduction also to the very similar problem of find- 
ing a statistical decision function. Like the papers of Milnor 
and of Radner and Marschak following it, this paper starts with 
the game viewpoint concerning decision problems. By "the 
game viewpoint" we intend to describe the statement of such 
problems in terms of matrices whose entries are the values of 
utility functions: in the decision problem for individuals, the 
entry uy can be taken to be the utility of the outcome consequent 
upon selection of the ith alternative course of action in the event 
the "true state of nature " is the jth of the possible states. As 
Goodman points out this is formally similar to one simple 
formulation of the problem of group choice. He shows how a 
number of special means of obtaining social choice functions 
can be subsumed under one method of amalgamation, and goes 
on to discuss the relations between this method and several 
others. 

Chapters IV and V are alike in many respects. In * Games 
Against Nature", John Milnor looks at three lately proposed 
criteria for decision- making together with a modernized version 
of the standard Laplace criterion. As in so many cases in sta- 
tistics and other young disciplines, such criteria have usually 
been advanced in company with proofs that they possess one or 
several intuitively desirable properties not possessed by com- 
peting criteria, and the argument left at that. But Milnor shows 
that by now we have too much of a good thing: in fact it is im- 
possible for any criterion to possess all of the properties which 
have been advanced as intuitively desirable. This discussion 
brings the fundamental differences among the four criteria into 
sharp focus. Milnor then proposes another and less restrictive 
set of " desirable properties" which he proves can be simultane- 
ously satisfied, though he cannot construct such a criterion which 
is computationally practicable. 
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Radner and Marschak, in their "Note on Some Proposed 
Decision Criteria", discuss certain undesirable properties of 
two of the four methods Milnor considered (actually they use a 
slight generalization of the Hurwicz criterion). In the game 
they construct for this purpose the Hurwicz criterion runs 
counter to common sense in dictating that a strategy be chosen 
on the basis of but one observation where it is obviously reason- 
able to make a great many. They illustrate the * dependence on 
irrelevant alternatives" of Savage's minimax- regret criterion 
by considering the solutions it prescribes in two cases: one where 
the player may use any strategy, and the other where he is re- 
quired to use what they call a * maximum likelihood rule". 
Though he will actually use a maximum likelihood rule in the 
first case also - even when not required to - the minimax- regret 
strategies call for different numbers of observations in these 
two cases. 

In a Social Choice and Strength of Preference", Coombs re- 
turns us to the problem of group choice. Here he applies his 
measurement theory and technique for * ordered metric scales" 
to derive a social utility function which explicitly weights individ- 
uals' strength of preference. Assuming a common unit of meas- 
ure for utility, and with certain other restrictions, he shows 
how to get such a social utility without having to measure indi- 
vidual utilities numerically. The resulting utility may be de- 
scribed as giving the "least disliked" alternative as the social 
choice. He compares his procedure with several others com- 
mon in psychological measurement and also illustrates it with 
an experiment. 

Finally, Vail's paper on "Alternative Calculi of Subjective 
Probabilities" leaves formal structures aside to consider ques- 
tions concerning the variables that figure in formulations of all 
these problems. Not all of these questions, as he points out, 
have anything to do with uncertainty; it is clearly desirable to 
dispense with those which do not before taking up, as he does in 
some detail, the various interpretations which might be given to 
the word a probability ". (It is perhaps not entirely clear that the 
so-called "objective probability" is quite as simple as his treat- 
ment indicates.) In addition to mathematical probability and 
"objective" probability we may distinguish at least two kinds of 
psychological or subjective probability: the individual's guess 
as to what number best approximates his concept of objective 
probability for an event, and his "degree of belief". These 
need not be the same. Another valuable distinction concerns the 
nature of the event in question. Is it in every relevant aspect 
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unique? Or is it a member of a finite - or of an infinite - class 
of somehow similar events? Vail classifies choice problems 
according to such distinctions and discusses empirical implica- 
tions of this classification. 



PART TWO 



Perhaps some readers will be surprised to see that learning 
theory plays a sufficient role in our subject to merit a separate 
part of this book. First we should make it clear to non- 
psychological readers that within that discipline the topic ordi- 
narily identified as "learning theory " has far broader scope than 
might be guessed from merely reading these three chapters. 
Much of what psychologists mean by learning theory is concerned 
with analysis, or at least classification, of what are regarded as 
determinants of human behavior in learning situations; our three 
chapters give this feature of the subject only incidental mention. 
But a certain class of learning experiments happens to be based 
on one of the simplest imaginable decision situations. And for 
handling this kind of data our authors have developed methods 
which are at once part of the modern treatment of decision proc- 
esses and also offshoots of the traditional learning theory of 
psychology. 

The Bush-Mosteller linear operator model in learning theory 
has by now been extensively developed and applied to many learn- 
ing experiments. In Chapter VIII Bush and Mosteller have col- 
laborated with the mathematician G. L. Thompson to give an 
extensive discussion of the formal structure of their model and 
some of its mathematical consequences. Psychologists will be 
aware from their own literature of the many applications this 
model has found, while others may feel this introduction to the 
subject more immediately illuminating for its purely formal 
character. 

On the surface, their formulation of the problem confronting 
the individual in terms of alternatives, outcomes, and the 
alternative- outcome pairs called events is similar to that of 
statistical decision theory. A first difference occurs in that the 
probabilities associated with these events may depend on the 
alternative chosen. And from here on they move in a quite dif- 
ferent direction. The fundamental distinction lies in the 
sequential nature of the learning experiment. Thus the individu- 
al's tendencies at trial n to choose the various alternatives are 
given by the components of a probability vector p(n); the authors 
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seek to describe the way this vector changes from trial to trial. 
They analyze these changes on the basis of three assumptions: 
(i) that the change from one trial to the next is given in operator 
form: Tj^pCn) = p(n+l), where the stochastic operator Tj k de- 
pends solely on the event Ej^ which occurred on trial n(" Markov 
hypothesis*); (ii) that each T jk is linear, and (iii) that the Tj k 

satisfy their (rather restrictive) "Combination of Classes" 
requirement. 

Independently of Bush and Mos teller, W. K. Estes had devel- 
oped a stochastic theory of learning which in many respects 
resembled theirs. His Chapter DC deals with a part of his work 
which strikes close to the problems of decision theory; it hap- 
pens that the mathematical formulation of this special case in 
his theory can also be obtained from the Bush- Mos teller model. 
(However, this paper represents only one particular application 
of the Estes theory which is in general more concerned with the 
determinants of the learning changes than the Bush-Mosteller.) 
Estes reports here one experiment, which in addition to illus- 
trating this feature of his theory is also interesting for having 
led to a great deal of discussion at Santa Monica and to con- 
siderable further experiment since then. (Some of this discus- 
sion appears in Chapter XVIII; see also our remarks on that 
chapter.) 

*On Game- Learning Theory and Some Decision- Making 
Experiments,* by M. M. Flood, studies games in which a player 
learns to improve his strategy during the course of a sequence 
of plays. In such games the situation can be regarded as "static* 
if the opponent uses a constant mixed strategy throughout the 
sequence of plays; allowing him to vary his strategies creates 
a more general "dynamic" problem. Even though first examina- 
tion of game- learning involves only the u static model", Flood 
finds it necessary to combine game theory with the Bush- Mosteller 
learning theory in his analysis. Within this context he discusses 
the results of Monte Carlo computations and some pilot experi- 
ments with human subjects. 



PART THREE 



Perhaps the most controversial theme in our whole subject 
was that of utility. Here again it is helpful to distinguish the dif- 
ferent kinds of objections that can be raised. Some of these are 
that: (1) the utility scheme does not describe the actual behavior 
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of individuals at all faithfully; (2) the assumption that people 
even act as if they were maximizing utility cannot be sustained 
by empirical observation; (3) utility cannot even provide a 
sound basis for normative theory because, for instance, indi- 
viduals could not find any consistent empirical interpretation for 
utility statements, or (4) the logical formulation of abstract 
utility theory is inconsistent. (We believe the von Neumann- 
Morgenstern axioms constitute a counterexample to the last 
statement.) 

Utility has always been a vexing issue. A considerable por- 
tion of the economic and philosophical disputation of the past 
century dealt with various aspects of this concept. To avoid 
confusion with these purely historical matters, we will mention 
several questions concerning the older concepts of utility which 
are not at issue in current debate. First, there is no connota- 
tion in the kind of utility discussed in this volume of any "utili- 
tarian* or welfare character. On the other hand, neither do our 
authors insist on any particular hedonistic psychological inter- 
pretation for their utilities (parts of Bohnert's paper only seem 
to be exceptional here). Finally, philosophers have frequently 
contested the validity of utility statements on the grounds that 
what was intended was not an " extensive magnitude", or some 
such argument; they often implied that in order to be used it 
had to be measured and that in order to be measured it had to be 
measured on something at least as strong as a ratio scale (see 
Chapter II). The von Neumann- Morgenstern theory and its ap- 
plication in game theory provide a counterexample here too. It 
turns out that utility so defined is to be measured on the weaker 
interval scale, and in the papers by Hausner and Thrall we see 
that even this can be somewhat weakened without loss of the 
practical applications. 

Discussions of utility characteristically begin with a preference 
ordering on a set of objects, prospects of events. Abstractly, the 
starting point is just a chain order defined on some set. Now in 
the face of uncertainty, preference alone is not sufficient to de- 
termine "rational" grounds for choosing one course of action 
over another; at the simplest level, for instance, most people 
would prefer $110 to $100, but would prefer an outright gift of 
$100 to a 50-50 chance of winning $110. Considerations of this 
kind lead in the von Neumann- Morgenstern approach to assuming 
initially that the given set of prospects is "closed under" proba- 
bility combination, i.e., that it is substantially what Hauser and 
Thrall call a mixture space. Now a utility function is classically 
a certain kind of real- valued function so defined on the set of 
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prospects as to preserve the preference order. Starting - for 
such practical reasons as suggested above - with a mixture 
space, it is straightforward to get the utility function: just ex- 
tend the preference ordering to the whole mixture space in a 
way that fits with the probability combinations. 

The path taken by Gerard Debreu in the first chapter of Part 
Three is more direct. For game theory and statistical purposes 
one wants the probability combinations, but they are unneces- 
sary in many economic applications. Debreu asks when we can 
reasonably expect to be able to define a real-valued function on 
an arbitrary chain ordered set in such a way as to preserve the 
order. He then shows that if the given set is any separable 
connected topological space in which the topology fits with the 
given order in a certain natural way, then the (preference) 
ordering can be extended to a utility function. He further shows 
that the assumption of connectedness can be given up by addi- 
tionally assuming perfect separability (second countability). 
The latter statement gives the theorem for any subset of 
Euclidean space in which the relative topology matches the 
given ordering in the prescribed way. 

An objection to their utility axioms which von Neumann and 
Morgenstern mentioned lies in the "Archimedean* nature of the 
utility: which means, in this case, that if A, B and C are any 
elements of the mixture space ranked in that order under the 
preference relation, there is a probability p such that B is 
equivalent in preference to the combination consisting of "A 
with probability p or C with probability 1 - p". This overlooks 
the possibility of any sum mum bonum or summum malum, and 
Thrall gives examples in which it might be reasonable to reject 
this Archimedean property. 

The multidimensional utility theory that answers these ob- 
jections was developed in several steps, all taking place at the 
RAND Corporation. In the fall of 1951 Thrall and Norman 
Dalkey worked out this theory for the finite-dimensional case. 
Melvin Hausner and J. G. Wendel followed shortly with a gen- 
eralization to infinite dimension which also presented the Thrall- 
Dalkey results in simpler, better unified form. Then in the 1952 
project Thrall and Dalkey continued this development with a 
study of possible applications of this more general utility concept. 

In * Multidimensional Utilities 19 , Hausner attacks the abstract 
problem of constructing and characterizing utility in the non- 
Archimedean case by splitting the problem into two parts. First 
he considers mixture spaces, and shows that any one can be 
imbedded in a vector space; then he introduces order axioms to 
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make the mixture space into a (no n- Archimedean) utility space, 
which he shows can be imbedded in an "ordered vector space*. 
Finally he characterizes these ordered vector spaces. 

In the next chapter, Thrall undertakes a discussion of the 
utility axioms both from the mathematical point of view and 
from that of empirical interpretation. He constructs examples 
to show the abstract desirability of several of the axioms and 
possible objections to others. Finally he indicates that the non- 
Archimedean utilities can still be used to get the results of 
game theory; in particular, they permit computation of solutions. 

In Chapter XIV Jacob Marschak carries out an extensive 
analysis of the activities of "teams" on the basis of utility theory, 
game theory and statistical decision procedures. This analysis, 
as he points out, is purely normative: how should a team act to 
maximize its gain? A "team" here is essentially a group for 
which there is a known group utility function which coincides 
with the common utility function of all the members. By weaken- 
ing his requirements on solidarity of interest he defines first a 
* foundation" and then a "coalition". The activities of the team 
members include making observations, performing actions, 
sending messages and giving orders. He analyzes a number of 
cases to determine the best rules of action and communication 
(a "rule" states for each member of the team what response he 
should make to every possible situation). The relatively diffi- 
cult manipulations required even for these simple cases show 
for one thing how desirable further development and simplifica- 
tion of the theory would be, while on the other hand they serve to 
emphasize how difficult would be any analysis at all without the 
machinery of this formalization. 

In the last chapter of this part we have a dissenting note. 
Herbert Bohnert brings together some of the objections that are 
still offered to any suggested interpretation of the utility concept. 
While many of the former objections are banished in the modest 
assertions of von Neumann- Morgenstern utility theory, and 
while this theory is certainly unexceptionable from an abstract 
viewpoint, Bohnert questions - among other things - whether we 
can ever say what kind of a thing "E" is in the sentence *The 
utility of E for x at time t equals u utiles". ("Utile" is a popular 
word for a "unit" on the utility scale established by any given 
utility function; of course, utility is only invariant up to change 
of origin and scale-factor.) Certainly several of Bohnert' s ob- 
jections seem serious in considering any empirical interpreta- 
tion of utility; especially so since there is a tendency in parlor 
discussions to answer difficult points by saying "That has to be 
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included in the utility function". This tendency can lead to a 
utility function which must, eventually, be defined at any given 
time for all possible future states of the universe. Whether 
such objections have the same force in many fairly straight- 
forward applications discussed by our authors is another 
matter. 

In any event we are hardly in a position to dispense with the 
utility concept. Leaving aside its many special applications in 
economics, we have suggested above that it is a natural con- 
comitant of the analysis of any choice situation involving un- 
certainty. There utility was ushered in with the preference 
ordering of a given set of probability combinations. This situa- 
tion arises in any choice problem under uncertainty. In game 
theory and in the general statistical decision problem there is a 
set of outcomes on which we assume a preference ordering, and 
as before the element of uncertainty introduces utilities. For 
any presently conceivable attack on these problems requires the 
great simplification of summarizing preference in the face of 
risk with one quantity - an expected value. It is the utility con- 
cept that justifies this simplification. (With the assumptions of 
utility theory the two-person zero-sum game is ready for solu- 
tion - since we assume opposing utilities for the two players. 
In statistics we have a game against nature, but we cannot gen- 
erally assume utilities for this opponent or that nature is "hos- 
tile". Expected loss is thus not yet determined and this leads 
to the problems concerning decision criteria discussed in Chap- 
ters IV and V.) 

Himself recognizing the value of the utility concept, Bohnert 
concludes his paper with a brief suggestion for a new kind of 
definition of utility; this rests on Carnap's concept of logical 
probability and makes use of the notions of modal logic. It 
avoids the feature he found most objectionable in other defini- 
tions - that the domain of the utility function was undefined or 
undefinable - because this domain is now a set of propositions 
in a language system (we are not certain, though, whether his 
replacement of Carnap's "sentence" by * proposition" is entirely 
harmless in this respect). But the language systems to which 
Carnap has succeeded in extending his notion of logical proba- 
bility are rather limited; and it is our impression that very few 
logicians - let alone statisticians or scientists - are yet con- 
vinced of the eventual practicability of Carnap's program. Since 
Bohnert' s definition schema is not sufficiently detailed for us to 
be certain of its nature even within a Carnap language, it may 
seem more remote from application than those he criticizes. 



INTRODUCTION 15 

PART FOUR 

There is really no sharp line in our book between "experi- 
mental" papers and theoretical ones. Chapter DC and possibly 
Chapter X might be considered primarily experimental, on the 
one hand, while on the other it might be argued that Chapter XVII 
was primarily theoretical and belonged either in Part One or 
Part Three, or that Chapter XVIII should have followed theEstes 
paper in Part Two. Nevertheless, we found it most natural to 
place these last two with other more purely experimental papers. 

Hoffman, Festinger and Lawrence chose for examination from 
the standpoint of general psychology one of the crucial problems 
in the theory of n- per son games. The von Neumann- Morgenstern 
theory requires that players decide on coalitions strictly on the 
desire to maximize their returns. That is, in rational play of 
the three- per son game, for instance, each player is willing to 
enter into either of the available two- man coalitions, and he de- 
cides between these solely on the basis of his expected gain 
from membership. Thus in the a simple majority game 9 some 
pair of the three players will form a coalition and win the 
stakes; but game theory cannot predict in this case which pair 
will do so - nor how they will divide the profits afterward. 

These authors felt that the consequences of this analysis 
were contrary to everyday experience, particularly in the ap- 
parent instability of the coalitions in the simple majority game. 
To them it seemed likely that other factors besides monetary or 
" point 1 * advantages would in this case be decisive in determining 
coalitions, and that the coalitions would in fact become predict- 
able and relatively stable. In particular they conjectured that 
one of these factors would depend on the individual's concern 
over his relative status among the players, and that a second 
would be the importance he assigned to the task of succeeding 
in the game. They support these contentions with the descrip- 
tion of a carefully designed and executed experiment. 

It would be misleading, of course, to conclude from this that 
the game-theory analysis of the simple majority game is 
wrong; the games these authors describe are very different 
games, with players' utility systems quite different from the 
symmetric ones of the simple majority game. What might be 
maintained, however, is that the kind of utility consideration 
they here introduce is at once among those which are very hard 
to treat in game theory, and yet of a kind which may naturally be 
expected to obtain in most observations of actual behavior. They 
have given an example of a gamelike situation which may be bet- 
ter treated in the context of general psychology than in the theory 
of games. 
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"On Decision- Making Under Uncertainty*, by Coombs and 
Beardslee, presents at first a very general theory which is at 
least in part designed both as an empirical descriptive frame- 
work and as a normative theory. After laying down their gen- 
eral definitions the authors give an intuitive analysis of a num- 
ber of important features of their theory on the basis of certain 
restrictive assumptions. Here they enlist the reader's geomet- 
rical intuition by an imbedding in Euclidean 3- space. This per- 
mits easy visualization of the relations they discuss between 
various experiments, as well as of the designs for new experi- 
ments suggested by the theory. In these terms they compare, 
for instance, the assumptions underlying the well known 
Mbsteller-Nogee and Preston- Bar rata experiments. Thus the 
former gave a measure of utility on the assumption of a linear 
relation between psychological and objective probabilities, while 
the latter measured psychological probability assuming linearity 
between utility and money. Coombs and Beardslee further re- 
port a pilot experiment carried out in Santa Monica which indi- 
cated that with increasing "stake" the subject tended to prefer 
those offers involving greatest certainty, even though they were 
of reduced utility. 

In his second contribution to our volume Flood examines the 
behavior of subjects in "prediction experiments* of the Estes 
type (Chapter DC). Flood sought an experiment to test the hy- 
pothesis that subject behavior was "mixed" or "pure" according 
as the subject believed the events being predicted followed a 
pattern or were randomly determined. Here, "pure" behavior 
is characterized by the subject's choice at some point in the se- 
quence of trials of a single best strategy (to which he thereafter 
adheres), and "mixed" behavior occurs when the subject gives 
no evidence of any intention to settle on one pure strategy. A 
stationary stochastic process is one whose probability distribu- 
tions do not change with time, and in general "non-stationarity" 
is merely the denial of this changelessness. In these experi- 
ments it happens to be reasonable to suppose that "non- 
stationarity" usually involves the subject's assuming that the 
events being predicted are determined according to some 
(eventually guessable) pattern. Flood reports in this paper the 
results of two pilot experiments which may lend support to his 
(emminently reasonable) hypothesis. (The editors feel that his 
discussion of the remarks of "various game theorists in the 
audience" may be misleading. It is our recollection that in his 
initial report Estes left an impression with the audience that his 
subjects had reason to believe the process was random, and we 
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further recall that no one failed to preface remarks about 
"rationality" or * proper strategies* with a more or less explicit 
assumption about payoff utilities, such as "If they wanted to 
maximize their scores... ".) 

Our final paper furnishes evidence of the educational compass 
of our summer in Santa Monica. Here four pure mathematicians 
report on a series of experiments which they themselves devised, 
carried out and analyzed. To be sure, these experiments lay 
close to the author s' professional interests; they dealt with 
questions concerning the empirical interpretation of some fun- 
damental concepts in the theory of n-person games. 

The n- person game is analyzed by von Neumann and 
Morgenstern in the framework of coalitions and side-payments 
within these coalitions. (This analysis leaves the theory inap- 
plicable from the very start to most many- person parlor games 
such as bridge and hearts.) Nothing is said about how coalitions 
are formed, except that the resulting coalition must be among 
those prescribed by * solutions*. Whatever the process of their 
formation, it seems to admit of no realistic interpretation in 
typical economic situations, since for large n communication 
difficulties would prevent consideration of some (possibly 
optimal) coalitions. 

The payoffs to the players are given as components of vectors. 
A a solution set" is a set of these vectors with certain (necessarily 
rather weak) properties expressed in terms of " dominance*. The 
important thing is that there are in all known non-trivial cases 
far too many * solutions"; you cannot even tell from the solution 
set what coalitions should be formed. And if you knew this, you 
still could not tell how members of the winning coalition would 
split the swag among themselves. Briefly, there are seemingly 
always too many solutions (though it cannot be proved that there 
always is even one), and these solutions are very "un-solutionlike 

Such objections to the original approach of von Neumann and 
Morgenstern led to formulation of various new concepts in the 
n-person theory. Thus Nash's analysis of non-cooperative 
games and his definition of equilibrium points there do away 
with pre-game coalitions and side-payments. Again, the 
Shapley value is a function which permits a prospective player 
to assign a sort of potential value to each of the "roles" or 
"positions" in a game. And Milnor gave upper bounds for the 
amounts each player or set of players "ought" to be able to re- 
ceive from a game. 

Now whether these or similar concepts are actually realized 
in empirical plays of n-person (cooperative or non-cooperative) 
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games is a question that must be answered before we can dis- 
cuss the merits of game theory as a mathematical model for 
the economic theory of exchange. For the expected generaliza- 
tion from games of strategy to economic problems will proceed 
by means of the theory of general n-person games. This is one 
side to the desirability of experiment in n-person games. The 
other is that observing how people actually play games may well 
suggest new concepts deserving of formalization in the theory. 
These four authors discuss a number of experiments, principally 
in light of the question: How may such experiments be sharpened 
for either one of these two purposes? 

Some readers might appreciate comment on the varying 
degrees of mathematical difficulty of our papers. The only 
papers, we feel, that require any real mathematical sophistica- 
tion for understanding the proofs are those of Milnor (IV) and 
Hausner (XII). On the other hand, papers which contain suffi- 
cient mathematical detail to demand fairly close reading include: 
Radner and Marschak (V), Bush-Mosteller-Thompson (VIII), 
Flood (X), Debreu (XI) and Marschak (XIV). Finally, although 
the experiments and results of the four- authored paper on ex- 
perimental n-person games are either well explained or self- 
explanatory, a fair knowledge of game theory may be prereq- 
uisite to a complete grasp of the purposes and possible 
significance of the experiments. 
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We shall undertake first to review the role of mathematical 
models in a science and then briefly discuss the models used in 
classical measurement theory. This will be followed by a gen- 
eralization of measurement models. Illustrations will be intro- 
duced when needed to clarify the concepts discussed. 



I. THE ROLE OF MATHEMATICAL MODELS 

We shall use the terms "physical objects, " M real world, " and 
"object system" synonymously to signify that which the empirical 
scientist seeks to study, including such objects as opinions or 
psychological reactions. The scope and content of a domain is 
selected by the scientist with the intent of discovering laws which 
govern it or making predictions about it, or controlling or at 
least influencing it. 

There are potentially at least as many ways of dividing up the 
world into object systems as there are scientists to undertake 
the task. Just as there is this potential variety of object systems, 



*This paper is an outgrowth of a number of sessions on measure- 
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so also is there a potential variety of mathematical systems. 
Let us describe the nature of a mathematical system. (For a 
more detailed discussion of the nature of mathematical systems, 
see [8], [12], and [13].) A mathematical system consists of a 
set of assertions from which consequences are derived by mathe- 
matical (logical) argument. The assertions are referred to as 
the axioms or postulates of a mathematical system. They always 
contain one or more primitive terms which are undefined and 
have no meaning in the mathematical system. The axioms of the 
mathematical system will usually consist of statements about the 
existence of a set of elements, relations on the elements, prop- 
erties of the relations, operations on the elements, and the 
properties of the operations. Particular mathematical systems 
differ in the particular postulates which form their bases. It is 
evident then that the variety of mathematical systems is limited 
only by the ability of man to construct them. 

Our view of the role that mathematical models play in a sci- 
ence is illustrated in Figure 1. With some segment of the real 
world as his starting point, the scientist, by means of a process 
we shall call abstraction (A), maps his object system into one of 
the mathematical systems or models. By mathematical argu- 
ment (M) certain mathematical conclusions are arrived at as 
necessary (logical) consequences of the postulates of the system. 
The mathematical conclusions are then converted into physical 
conclusions by a process we shall call interpretation (I). 

Let us start with a specific real world situation (RW)j and by 
process A map it into a mathematical system (MS)j. We can 
look at (MS)j as a model of (RW)j. Looked at in reverse, we 
can start with consideration of (MS)j and then (RW)j can be 
viewed as a model of (MS)j and the process of going from (MS)j 
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Fig. 1. The Symmetrical Roles of Experiment and Mathematics 
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to (RW)j we call "realization." Thus, "realization" is the con- 
verse of "abstraction." Now, given (MS)j we might be able to 
find a real world situation, (RW^, such that by assigning mean- 
ings to the undefined terms of the mathematical system the as- 
sertions about "sets of elements," "relations," and "operations" 
in (MS)} become identified with objects or concepts about (RW) 2 . 
That is, (RW) 2 may be another model of (MS)j and the process 
of going from (RW)j to (MS)j to (RW) 2 often indicates subtle 
analogies between systems such as (RW)j and (RW) 2 . To the 
mathematician who often starts with an abstract system the 
model is a concrete analogue of the abstract system. To the 
social scientist who starts with phenomena in the real world the 
model is the analogue in the abstract system. 

In establishing a model for a given object system one of the 
most difficult tasks is to attempt a division of the phenomenon 
into two parts; namely that part which we abstract (A) into the 
basic assumptions or axioms of the abstract system and that 
part which we relegate to the physical conclusions, and which 
we reserve as a check against the interpretations from the ab- 
stract system. In a given object system there is no unique par- 
tition of the phenomena and which partition is made depends on 
the creative imagination of the model builder. Indeed, there are 
models in the physical and biological sciences for which there 
are no experimentally verified or verifiable correlates in the 
real world for the undefined terms, relations, and operations in 
the abstract model. A similar situation prevails on the abstract 
system side, namely, it is often possible in a given abstract sys- 
tem to interchange the roles of certain axioms and theorems. 
Thus, in a given system there is no unique method of splitting 
the mathematical propositions into axioms and theorems. In 
going from the abstract system to the object system we have the 
parallel processes of realization and interpretation. It is quite 
common to consider these synonymous; however, we prefer in 
this discussion to reserve the word "interpretation" for the 
process which maps the mathematical conclusions (rather than 
the axioms) into the object system. 

Let us summarize briefly up to this point. Beginning with a 
segment of the real world, the scientist, by an entirely theoreti- 
cal route, has arrived at certain conclusions about the real world. 
His first step is a process of abstraction from the real world, 
then a process of logical argument to an abstract conclusion, then 
a return to the real world by a process of interpretation yielding 
conclusions with physical meaning. But there is an alternative 
route to physical conclusions and this is by way of working with 
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the object system itself. Thus, the scientist may begin with the 
real world segment in which he is interested and proceed di- 
rectly to physical conclusions by a process of observation or 
experiment (T). 

The path (T) (experimentation) from the real world to the 
physical conclusions needs further scrutiny. Usually in theory 
construction the scientist embarks on model building after he 
has many facts at his disposal. These facts he partitions into 
two parts one part serves as a springboard for the abstraction 
process (A); the other part serves as a check on the model by 
making comparisons with these initial facts and the interpreta- 
tions (I) stemming from the model. If a specific interpretation 
is not at variance with a fact in the initial reservoir, but at the 
same time not corroborated by our a priori notions of the ob- 
ject system, then the model perhaps" has contributed to our 
knowledge of the object system. The scientist next tests this 
tentative conclusion by setting up a plan of experimental veri- 
fication, if this is possible. Often direct verification may not 
be possible, and corroboration stems from examination of ex- 
perimental evidence which supports claims of the model quite 
indirectly. That is, motivated by interpretations of the model, 
the scientist sets up an experimental design, obtains observa- 
tions by experimentation, makes a statistical interpretation of 
these observations into physical conclusions, and compares the 
conclusions with those of the abstract route in order to appraise 
the model. As suggested by a referee (Frederick Mosteller) of 
this paper, it would be appropriate to generalize Figure 1 as 
shown in Figure 2. The route A2EI2 in Figure 2 is summarized 
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Fig. 2. A Generalization of Figure 1 
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by the route T in Figure 1. If the physical conclusions of the 
process A^MIj are at variance with the a priori facts or with 
conclusions arrived at via A 2 El2 (and if more confidence is 
placed in the experimental route than in the theoretical route) 
then the suitability of the model is suspect. 

The task of a science looked at in this way may be seen to be 
the task of trying to arrive at the same conclusions about the 
real world by two different routes: one is by experiment and the 
other by logical argument; these correspond, respectively, to 
the left and right sides of Figures 1 and 2. There is no natural 
or necessary order in which these routes should be followed. 
The history of science is replete with instances in which physi- 
cal experiments suggested axiom systems to the mathematicians 
and, thereby, contributed to the development of mathematics. On 
the other hand, mathematical systems developed under such 
stimulation, in turn, suggested experiments. And there have 
been many instances of mathematical systems developed without 
reference to any known reality which subsequently filled a need 
of theoretical scientists. The direction that mathematics has 
taken is in considerable part due to its interaction with the physi- 
cal sciences and the problems arising therein. 

It is illuminating here to observe the way in which the models 
of the mathematical theory of probability and statistics fit this 
picture. As in any abstract system the mathematical theory of 
probability is devoid of any real- world content; and as in any 
other mathematical system the axioms of probability specify 
interrelationships among undefined terms. It is common to let 
the notion of probability itself be undefined and to attempt to 
capture in the axiomatic structure properties of probability 
motivated by the interpretations we have in mind (e.g., gambling 
games, physical diffusions processes, etc.). Given an associa- 
tion of probabilities to prescribed elementary sets, the axioms 
of probability dictate how one must associate probabilities with 
other sets. How we make these preliminary associations, pro- 
viding we have consistency, is not relevant to the purely abstract 
system. When we come to apply the probability model we are 
confronted with the problem of identifying real events with ab- 
stract sets in the mathematical system and the measurement 
problem of associating probabilities to these abstract sets. Ex- 
perience has taught us that if we exploit the notion of the relative 
frequency of occurrence of real events when making our pre- 
liminary associations then the interpretations from the model 
have a similar frequency interpretation in the real world. To 
be sure, our rules of composition in the formal system were 
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devised with this in mind. We associate probabilities in one way 
rather than in another way in the process (A) so that when we 
generate AMI, our interpretations are in "close" accord with 
results of experimentation, T, when T is possible. When T is 
not possible we have to rely to a great extent on analogy. 

An extremely important problem of statistics can be viewed 
as follows: For a priori reasons we may have a well-defined 
family of possible probability associations. Each element of this 
family, when used in the abstraction process (A), generates by 
AMI a probability measure having a frequency interpretation 
over real events. In addition, we are given a set of possible ac- 
tions to be taken. Preferences for these actions depend in some 
way on the relative "appropriateness" of different probability 
associations in the abstraction process, A. By conducting an 
experiment, T, and noting its outcome we gain some insight into 
the relative "appropriateness" of the different probability asso- 
ciations and thus base our action accordingly. Variations of this 
problem, which involves the entire AMI-T process, have been 
abstracted sufficiently so that models of mathematical statistics 
include counterparts of all these ingredients within the mathe- 
matical system itself. 

In a given model we may be confronted with the problem of 
deciding whether the AMI argument gives results "close enough" 
to the experimental results from T. We often can view this prob- 
lem involving a complete AMI-T process as the real world phe- 
nomenon to which we apply the A process sending it into a formal 
mathematical statistics system. The statistics system analyzes 
step M and our interpretation I takes the form of an acceptance 
or rejection statement concerning the original theory. 

The process of measurement, corresponding to A in Figure 1, 
provides an excellent illustration of the role of mathematical 
models. There are many types of observations that can be called 
"measurement." Perhaps the most obvious are those made with 
yardsticks, thermometers, and other instruments, which result 
immediately in the assignment of a real number to the object 
being measured. In other cases, such as the number of correct 
items on a mental test or the size of a herd of cattle, the result 
of measurement is a natural number (positive integer). In still 
other cases, such as relative ability of two chess players, rela- 
tive desirability of a pair of pictures, or relative hardness of two 
substances, the result is a dominance (or preference) relation. 
We might even stretch the concept of measurement to include 
such processes as naming each element of some class of objects, 
or the photographic representation of some event, or the cate- 
gorization of mental illnesses or occupations. 
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The process of measurement may be described formally as 
follows. Let P = {PI, P2i } denote a set of physical objects 
or events. By a measurement A on P we mean a function which 
assigns to each element p of P an element b = A(p) in some 
mathematical system B = {bj, }. That is, to each element 
of P, we associate an element of some abstract system B (the 
process A of Figure 1). The system B consists of a set of ele- 
ments with some mathematical structure imposed on its ele- 
ments. The nature of the set P and the actual mapping into the 
abstract space B comprises the operation of measurement. The 
mathematical structure of the system B belong* to the formal 
side of measurement theory. The structure of B is dictated by 
a set of rules or axioms which states relationships between the 
elements of B. However, no connotation can be given to these 
elements of B which is not explicitly stated in the axioms, i.e., 
their labels are extraneous with respect to considerations of the 
structure of B. 

After making the mapping from P into B, then one may op- 
erate with the image elements in B (always abiding by the 
axioms, process M of Figure 1). Purely mathematical results 
obtained in B must then be interpreted back in the real world 
(the process I of Figure 1), to enable one to make predictions 
or to synthesize data concerning set P. 

If the manifestations of P (as a result of the process T of 
Figure 1) are in conflict with the results of process I obtained 
from B, then one must search for a new cycle AMI. Suppose 
that we have a family of abstractions { A# } from the given situa- 
tion P, and suppose that M# , !# complete the cycle begun with 
Aa- Among all of the available cycles AgMglg we seek one, 
say A o M o Io> which is "closest" to T according to some cri- 
terion. Some models have a criterion built in to judge closeness 
and others of a more deterministic nature require an exact fit. 

The process T represents the experimental or operational 
part of model building, the process M represents the formal or 
logical aspect. The processes A and I are really the keys to 
the model and serve as bridges between experiment and, formal 
reasoning. 

It might be well here to draw clearly the distinction between 
a model and a theory. A model is not itself a theory; it is only 
an available or possible or potential theory until a segment of 
the real world has been mapped into it. Then the model becomes 
a theory about the real world. As a theory, it can be accepted or 
rejected on the basis of how well it works. As a model, it can 
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only be right or wrong on logical grounds. A model must satisfy 
only internal criteria; a theory must satisfy external criteria as 
well. 

An example of the distinction between models and theories 
lies in the domain of measurement. A measurement scale, such 
as an ordinal, interval, or ratio scale is a model and needs only 
to be internally consistent. As soon as behavior or data are 
ft measured n by being mapped into one of these scales, then the 
model becomes a theory about that data and may be right or 
wrong. Scales of measurement are only a very small portion of 
the many formal systems in mathematics which might serve as 
image spaces or models, but will be discussed here as they con- 
stitute very simple and immediate examples of the role of 
mathematical models. First to be discussed will be the models 
of conventional measurement theory and then a generalization of 
these models will be presented. 



II. MATHEMATICAL MODELS OF CLASSICAL 
MEASUREMENT THEORY 

The first comprehensive classification of the mathematical 
models used in conventional measurement theory was made by 
Stevens [9]. He classified scales of measurement into nominal, 
ordinal, interval, and ratio scales, the latter two christened by 
him. A more complete discussion of these scales is contained 
in a later work by him [10], and also in Coombs [3], [5] and 
Weitzenhoffer [11]. Because of the literature available on these 
scales and because they constitute a restricted class of models 
they will be briefly summarized here only to provide a basis for 
generalization in the next section. 

The mathematical model of measurement is said to be nomi- 
nal if it merely contributes a mapping AQ of P into M o without 
any further structure on M o . A nominal scale M may be sub- 
jected to any 1-1 transformation without gain or loss in infor- 
mation. 

An ordinal scale of measurement is implied if there is a natu- 
ral ranking of the objects of measurement according to some at- 
tribute. More precisely, the ordinal scale is appropriate if the 
objects of measurement can be partitioned into classes in such a 
manner that: a) elements which belong to the same class can be 
considered equivalent relative to the attribute in question; b) a 
comparative judgment or an order relation can be made between 
each pair of distinct classes (for example, class x is more 
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than class y); c) there is an element of consistency in these 

comparative judgments namely, if class x is more than 

class y and class y is more than class z, then class x is 

more than class z (that is, the comparative judgment or 

order relation is transitive). For example, the familiar socio- 
economic classes, upper-upper, lower-upper, upper-middle, 
lower-middle, upper-lower, and lower-lower, imply the meas- 
urement of socio-economic status on an ordinal scale. The num- 
bers 1, 2, 3, 4, 5, 6, or 1, 5, 10, 11, 12, 14, or the letters 
A, B, C, D, E, F, could designate the six classes without gain 
or loss of information. 

The measurement is said to be an interval scale when the set 
M consists of the real numbers and any linear transformation, 
y = ax + b (a ^ 0), on M is permissible. Measurement on an 
interval scale is achieved with a constant unit of measurement 
and an arbitrary zero. An example of an interval scale is the 
measure of time. That is, "physical events 11 can be mapped into 
the real numbers and all the operations of arithmetic are per- 
missible on the differences between all pairs of these numbers. 

If the set M consists of the real numbers subject only to the 
transformation group y = ex where c is any non-zero scalar, 
the scale is called a ratio scale. Measurement on a ratio scale 
is achieved with an absolute zero and a constant unit of measure- 
ment. The scalar c signifies that only the unit of measurement 
is arbitrary. In a ratio scale all the operations of arithmetic are 
permissible. The most familiar examples of ratio scales are 
observed in physics in such measurements as length, weight, and 
absolute temperature. 



HI. A GENERALIZATION OF MEASUREMENT MODELS 

An axiomatic basis for certain scales of measurement will be 
presented in this section. Other scales can be generated by 
forming mixtures (or composites) of these. Indeed, some of the 
scales listed in the diagram shown in Figure 3 can be regarded 
as composites of others. 

We will now list defining axioms for each of these systems 
and briefly discuss their roles. It is not claimed that this list 
is exhaustive; it is presented to illustrate certain possibilities 
for significant generalizations of scales used in the classical 
theory. The arrangement in the diagram is from top to bottom 
in order of increasing strength of axioms; a connecting line in- 
dicates that the lower listed system is a special case of the 
higher one. 
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BQ, The Nominal Scale. A nominal scale, B Q , may be con- 
sidered a mathematical system consisting merely of a set of 
elements. We define the index of B o to be the number of ele- 
ments in B . (The index may be finite or infinite.) 

Examples of segments of the real world that are mapped into 
nominal scales are psychiatric classifications, job families, and 
disease types. 

The nominal scale, B o , is the most primitive step in any sys- 
tem of measurement. The set of elements is partitioned into 
classes such that there is a relation of "equality" or equivalence 
between pairs of elements from the same class. The nominal 
scale is fundamental since the process of discrimination is a 
necessary prerequisite for any more complex form of measure- 
ment. 

BI, The Relation Scale. Perhaps the smallest step that may 
be taken to strengthen this mathematical system is to introduce 
a relation between some pairs of elements. In technical language, 
a relation R on a set Bj is a set of ordered pairs (b, b ! ) of ele- 
ments of BI. We write bRb' to indicate that (b, b ! ) is one of the 
pairs included in the relation R, and call the set BI a relation 
scale. It is important to recognize that for R to constitute a 
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very useful relation, not all possible pairs (b, b f ) from B\ can 
be included in the relation R. 

With some risk of misinterpretation or distortion, these con- 
cepts might be illustrated as follows. Consider a set of persons 
identified by a nominal scale, B o . Let us now define the relation 
R on B to be "loves." Thus R consists of the ordered pairs 
(a, b) for which, a loves b. 

The particular relation used here as an illustration is one 
whose mathematical properties are mostly negative. We cannot 
conclude from a loves b and b loves c, that a loves c, or 
that b loves a, or that b does not love a. For example, if John 
loves Mary and if Mary loves Peter, it may well be that, far 
from loving him, John would like to see Peter transported to the 
South Pole. In the terminology to be introduced below, we would 
say that love is not symmetric, is not asymmetric, and is not 
transitive. 

B2, The Antisymmetric Relation Scale. A relation R on a set 
B is said to be antisymmetric if aRb and bRa together imply 
that a is identical with b. An example is the relation for real 
numbers. A statement such as, "Picture a is at least as good 
as picture b" illustrates an antisymmetric relation on a collec- 
tion of pictures, provided that there are not in the collection two 
distinct pictures of equal merit, i.e., two pictures about which 
the judge is indifferent. 

Closely connected to the concept of antisymmetry is that of 
asymmetry. A relation R on a set B is said to be asymmetric 
if aRb implies bR'a (where bR'a means that b is not in the 
relation R to a). The mathematical prototype of asymmetry is 
the relation > for real numbers. Verbal forms for asymmetric 
relations include such statements as, "Picture a is better than 
picture b," or "Player a beats player b in a game." 

Antisymmetry and asymmetry are seen to be at the root of 
statements of comparison. These two classes of relations can 
be regarded as the most primitive types of order relations. At 
the opposite pole from these concepts is that of symmetry. A 
relation R is said to be symmetric if aRb implies bRa. For 
example, the relations "is a sibling of," "is a cousin of," and 
"is the same color as" are all symmetric. 

If S is an asymmetric relation on a set B, we can obtain 
from it an antisymmetric relation R by the definition: aRb 
means either aSb or a = b. Conversely, if R is antisymmetric 
and we define aSb to mean aRb and a ^ b, then S is asym- 
metric. It is customary to use the symbols, ^ , ^ for anti- 
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symmetric relations and to use < , > for the associated asym- 
metric relations. 

B The Transitive Relation Scale. A relation R is said to 



be transitive if aRb and bRc imply aRc. In the physical world, 
preference judgments which are not transitive are frequently re- 
garded as inconsistent or irrational. However, situations such 
as that of three chess players, each of whom can beat one of the 
other two, show that transitivity is not a requirement of nature. 

The chess player relation is antisymmetric but not transitive. 
An example of a relation that is symmetric and transitive is 
given by a communication system where each link is bidirection- 
al; here aRb is given the meaning "there exists a chain of links 
starting with a and ending with b." If the links are not required 
to be bidirectional, the relation is still transitive but is no longer 
symmetric. Note that in this example it is quite possible to have 
aRa, i.e., a chain beginning at a and ending at a. (This chain 
must have at least one element different from a.) 

The relation "a is the rival of b" (say as suitors of a par- 
ticular girl) is symmetric and is almost transitive. If aRb and 
bRc, we can conclude aRc unless a = c; but we can hardly re- 
gard a as being his own rival. This type of relation arises fre- 
quently in studies of social structures. We say a relation R is 
quasi-transitive if aRb, bRc, and a ^ c imply aRc. The sibling 
relation is also quasi-transitive. Of course, if R is quasi- 
transitive we can define a new relation S to be the same as R 
except that also aRb, bRa imply aSa. In some instances S is 
just as good a model as R, but in others the extension from R 
to S destroys the usefulness of the model. 

As an example consider the structure matrix A = Hayll of 
some society. Thus we set ay = 1 if person i has direct influ- 
ence on person j and set ay = otherwise. One must decide in 
accordance with the purpose of the investigation whether or not 
to set the diagonal element an equal to or to 1. (The relation 
"i has direct influence on j" is not transitive even if we take 
each an = 1, but this example nevertheless illustrates the kind 
of problem involved in the contrast between transitivity and 
quasi-transitivity.) 

If the relation aRb meant a "is higher in socio-economic 
status than n b and this required that a had more income and 
more education than b, then the relation R would be asymmetric 
and transitive. 
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Eg, The Partly Ordered Scale, A relation which is re- 
flexive, antisymmetric, and transitive is called a partial order. 
If for some pair a, b, neither of the relations a b, b a holds, 
we say that a and b are incomparable relative to . In the case 
of a preference relation, incomparability is not the same thing 
as indifference. We call a set a poset (partly ordered set), if 
there is a partial order relation defined on B. 

If a b, we also write b 1 a; if a b and a ^ b, we write 
a > b or b < a. 

A partial order may be illustrated as follows. Suppose that 
on a mental test no two individuals in a group pass exactly the 
same items. Now let a b symbolize the relation n a passed 
all the items b did. M Then a >b means that "a passed all the 
items b did and at least one more." This poset reflects multi- 
dimensionality of the attributes mediating the test performance 
and some interesting mathematical problems arise regarding the 
partial order as a "product" of simple orders. The result is a 
non-metric form of factor analysis with some of the same prob- 
lems as factor analysis [3]. 

Next we consider the mental test example modified so as to 
allow the possibility that two individuals a and b pass exactly 
the same items. Then in the above notation we have a ^ b and 
b a, but not b = a. Hence, no longer gives a partial order. 
However, if we define a I b to mean a ^ b and b a, it is not 
hard to show that if we identify individuals with the same test 
performance then ^ is a partial order relation. Or, alterna- 
tively, we could consider as a partial order relation on the 
set of possible test performances. It is customary to make such 
identifications and speak of a partial order as though it were ac- 
tually on the initial set rather than on the identified classes or 
on the test results. 

Another example of a partial order is implicit in the treat- 
ment of the comparative efficiency of mental tests on a "cost- 
utility" basis [1]. "Cost" is the fraction of potentially success- 
ful people who are eliminated by a test; "utility" is the fraction 
of potential failures who are eliminated by the test. If for their 
respective cutting scores one test has a higher utility and a 
lower cost than another it is a superior test, but if it had a higher 
utility and a higher cost the two tests would be incomparable un- 
less the relative weight of excluding a potential success to includ- 
ing a potential failure were known. 

A basic problem in the theory of testing hypotheses in statis- 
tical inference is to test a simple hypothesis, H o ("null hypothe- 
sis"), against a single alternative hypothesis, HI, by means of 
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experimental data. A test, T, associates to each experimental 
outcome the decision to accept H o or to accept Hj (but not 
both!). Each test T is appraised by a pair of numbers, namely, 
the probability of accepting HI if H o is true, PX(HI(H O ), and 
probability of accepting H o if HJ is true, PT( H O! H I^ Given 
two tests, T f and T", then T' is said to be as good as T fl 
(T 1 T") if and only if 



i PT(HI|H O ) 

p T'< H ol H l> $ P T "< H ll H o> 

The relation on the set of all tests is an example of a partial 
order. 

64, Lattice. Let B be a poset relative to a relation . If 
a, b, c are elements of B and c a, c b, we say that c is an 
upper bound of a and b. If also c ^ x for every upper bound x 
of a and b, we say that c is the least upper bound of a and b 
and write c = a U b. In terms of the example of mental testing, 
c could be a person who passed exactly those items which were 
passed by at least one of a and b. Analogously, if d ^ a, d ^ b, 
we say that d is a lower bound of a and b, and if also d ^ y 
for all lower bounds y of a and b, we say that d is the greatest 
lower bound of a and b and write d = a n b. In our example d 
could be a person who passed exactly those items passed by both 
a and b. 

A pair a,b need not have a least upper bound, nor a greatest 
lower bound. For example, if a passed items 1, 2, 3, 4; b 
passed 1, 2, 5, 6; c passed 1, 2, 3, 4, 5, 6, 7; c' passed 1, 2, 3, 
4, 5, 6, 8; and there are no other persons, then both c and c 1 
are upper bounds to a and b but there is no least upper bound. 
Also in this case there are no lower bounds for a and b and 
hence no greatest lower bound. 

A poset is said to be a lattice if, for every pair a,b, both 
a u b and a O b exist. The lattice is an intermediate model 
between a partial order and a vector space. 

George Miller (personal communication) has recently investi- 
gated the use of a lattice theoretic treatment of information in 
experimental psychology. To each item of information he asso- 
ciates (process A) an element of a lattice. If two items of in- 
formation are associated respectively with elements x and y 
of the lattice then: 1) the item which consists of the information 
common to the original items is associated with the element 
x n y; 2) the item which consists of the information contained 
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in either of the original items is associated with the element 
x u y. As used by Miller an item of information might consist 
of a cue or a sequence of cues in an experimental situation. The 
common procedure is to summarize the structure of the experi- 
ment by means of a lattice, given an experimental setup. How- 
ever, the abstract lattice in turn can motivate new types of ex- 
perimental situations and indicate analogies between experimen- 
tal designs which otherwise would not be apparent. 

64', Weak Order. A transitive order ^ is defined on 641, 
and has the property that for every pair a,b either a S b or 
b ^ a. If both a ^ b and b ^ a, we say that a and b are indif- 
ferent. Indifference is an equivalence relation (i.e., is reflexive, 
symmetric, and transitive). 

A weak ordering would be illustrated by the military ranks of 
second lieutenant, first lieutenant, captain, major, etc. Each of 
these would constitute an equivalence class and for any two offi- 
cers (a,b), either a b or b ^ a, or both. 

85, Chain. A poset in which every pair is comparable is 
called a chain (or simple order, or linear order, or complete 
order). Alternatively, a chain is a weak order in which each 
indifference class consists of a single element. Here every 
pair of elements is ordered. 

The previous example of a weak ordering of military rank 
could be converted into a chain if date of rank, standing in class, 
etc., were taken into account. Then, for every two distinct ele- 
ments, a,b, either a > b or b > a. 

The ordinal scales of classical measurement theory are ex- 
amples of chains. 

BS*, Partly Ordered Vector Space. A special case of lattice 
is provided by a real vector space (or a subset of a vector space). 
A vector x = (xj, , x n ) is an ordered set of n real numbers 
called the components of the vector. We define x ^ y to mean 
that xi ^ yi for each component. (Here the second symbol ^ 
refers to the usual ordering of real numbers.) This definition 
makes the vector space into a poset, and this poset is a lattice 
which is called a partly ordered vector space. 

A partly ordered vector space is illustrated by the compara- 
bility of individuals in mental abilities. Conceiving of intelligence 
as made up of a number of primary mental abilities, each of these 
constitutes a component or dimension. Then, it may be said of 
two individuals x and y that y is at least as intelligent as x if 
and only if y has as much or more of each component as x does. 
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The term "vector" is sometimes used in a more general 
situation. If Cj, , C n are chain orders we may consider 
vectors or n-tuples c = (cj, , c n ) where the i-th component 
ci lies in the chain order Cj (i = 1, , n). The set C of all 
such vectors c is called the Cartesian product of Cj, , C n 
and is denoted by C = Cj x x C n . We can make C into a 
poset by a process analogous to that used above for real vector 
spaces. Note that a real vector space is the special case of a 
Cartesian product of n factors Cj, , C n each equal to the 
set of real numbers. 

Be> Simply Ordered Vector Space, or Utility Space. A real 
vector space (or subset) in which x < y is defined lexicographi- 
cally, i.e., x < y If xj = y lf , Xi_i = yi.i, xj < yj is a spe- 
cial case of simple order. 

A lexicographic ordering can be illustrated by a manner in 
which we might expect a fortune hunter to impose a chain order 
on a set of unmarried women. Presumably, financial assets 
would be the principal component and he might construct a weak 
ordering of the ladies into, say five classes, on this basis. Then 
he would turn to the second component, say beauty, and within 
each of the financial classes construct a simple ordering of the 
ladies on this component. Any two ladies (a,b) would then be 
simply ordered as follows: 

1) If a were in a higher financial class than b, a would 
be preferred to b. 

2) If a were in the same financial class as b, then prefer- 
ence would be determined by their relation on the beauty 
component. 

67, Real Numbers. The transition to scales using the real 
numbers has been given additional importance by the development 
of von Neumann-Morgenstern utilities. Even though one may 
wish to arrive here in order to have a simple index when a de- 
cision is to be made, it may frequently be desirable not to get 
here all at once, but to keep the components at a weaker level 
until it is necessary to map into the real numbers. 

Measurement scales involving the real numbers, the interval 
scale and the ratio scale, have been discussed at length in the 
literature [3], [5], [9], [10], [11] and will not be pursued again 
here. 
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IV, FURTHER EXTENSIONS 

The various mathematical systems discussed here as avail- 
able for measurement have been illustrated with objects of the 
real world mapped into the elements of an abstract system. A 
further level of abstraction is provided by defining a "distance 
function" in the abstract system, in which ordered pairs of ele- 
ments in the abstract system are mapped into elements of an- 
other abstract system about which a variety of assertions may 
be made. In the context of measurement, these pairs of elements 
may correspond to "differences 11 between pairs of objects in the 
real world. These differences may themselves then be mapped 
into an appropriate abstract system such as one of those dis- 
cussed here. 

A number of these types of scales have been discussed by one 
of the authors [4, Chapter l]. An illustration is the ordered 
metric scale in which the objects themselves satisfy 85, a 
simply ordered scale, and ordered pairs of objects, regarded 
as "distances" between them, satisfy 83, a partly ordered 
scale. Such scales are now being utilized for the measurement 
of utility and psychological probability in experiments on deci- 
sion making under uncertainty [6], [7]. 



V. SUMMARY 

One role of mathematical models is to provide a logical route 
to go from characteristics of the real world to predictions about 
it. The alternative route is by observation or experiment on the 
real world itself. The view expressed here is that these two 
routes are coordinate. 

The various scales used in measurement serve as an illustra- 
tion of the application of mathematical models and are subject to 
the same constraints as other mathematical models. That is, if 
the axioms underlying the scale are not satisfied by that segment 
of the real world which is mapped into it, then the interpretations 
of the mathematical conclusions may have no reality or meaning. 
Thus, to insist that measurement always constitutes the mapping 
of physical objects into the real number system is to impose on 
the real world an abstract theory which may be invalid. 

A partial ordering of various alternative mathematical sys- 
tems available for measurement has been presented with illus- 
trations in order to reveal the relative strengths of these scales 
to which the real world must conform to permit their application. 
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We make no claim to completeness in this list of models for 
measurement theory. Our purpose is to point out the richness 
of the set of possible models and to give some examples that 
show how the use of more general models can extend the domain 
of classical measurement theory. 

None of the discussion here should be taken as an argument 
for the use of weaker scales in the place of stronger scales for 
their own sake. The measurement scale utilized constitutes a 
theory about the real world and the stronger the theory the 
better, so long as it is correct. The addition of axioms to a 
scale which are not satisfied by the real world is a step away 
from the path of progress. 
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1. INTRODUCTION AND SUMMARY 

The problem which shall be the concern of this paper is 
closely related to the problem of social choice and individual 
values [l], the theory of statistical decisions [26], the theory of 
games against nature [21], the problem of normative behavior 
under uncertainty [20], and the problem of cooperative aggrega- 
tion [12], Let us first state the problem formally. We are given 
a finite matrix U 



U 



U 12 U 13 U 14 ' 
U 21 U 22 U 23 U 24 ' ' 
U 31 U 32 U 33 U 34 ' ' 

U42 U43 U44 



' U 2c 
* U 3c 



u r i u r2 



of utilities ujj which measure the utility of row i for column j fl 



*Part of this research was carried out while the author was a 
member of the University of Michigan Summer Seminar on De- 
sign of Experiments on Decision Processes. A part of this work 
was prepared under an Office of Naval Research contract. Some 
of the results were presented at a joint meeting of the Econo- 
metric Society and the Institute of Mathematical Statistics at 
Michigan State College, East Lansing, Michigan, on September 2, 
1952. The author is indebted to Harry Markowitz of the Rand 
Corporation for many helpful suggestions. 
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The problem is to devise rules (methods of amalgamation) for 
choosing the row with the f 'greatest utility. 11 

In games against nature [21], the rows represent possible 
strategies among which the player must choose and the columns 
represent possible states of nature. The payoff to the player 
when he chooses strategy (row) i is given by the entry uy if 
nature is in state (column) j. Here the entry uy represents a 
von Neumann-Morgenstern utility. The problem is to determine 
criteria or devise rules which the player might use for choos- 
ing the strategy (row) which will give him the "greatest utility/ 1 

In the problem of social choice and individual values [l], the 
rows represent possible social choices among which the "best 11 
choice is to be made and the columns represent the individuals 1 
values. That is, the value which individual j ascribes to social 
choice i is given by entry Ujj. The entries may represent the 
utility of social choice i for individual j, but will usually only 
describe the rank order assigned to social choice i by individual 
j (e g., aij = 1 if individual j ranks social choice i as his least 
preferred social choice, ay = 2 if individual j ranks social 
choice i as next to the least preferred social choice, . . .). The 
problem is then to amalgamate the values of the different in- 
dividuals (columns) in order to make the social choice which is 
"most preferred" by the group of individuals. 

In the theory of statistical decisions or the theory of norma- 
tive behavior under uncertainty, the rows represent courses of 
action and the columns represent the possible states of the world 
(see, e.g. [23]). The expected income resulting from act i is uy 
if the world is in state j. The problem is to choose the "best" 
course of action. 

In the problem of cooperative aggregation the rows may 
represent different alternatives and the columns represent dif- 
ferent bases of comparison, e.g., price, efficiency, and looks in 
comparing automobiles. Then the problem is to amalgamate the 
different bases of comparison in order to arrive at a method for 
deciding which is the "best" alternative. 

These problems have been studied by economicts, psycholo- 
gists, sociologists, statisticians, mathematicians, and philoso- 
phers. Since the formal aspects of these problems are so closely 
related to each other, one would expect that some of the results 
which have been obtained by, say, the statisticians would be re- 
lated to results obtained by those studying the problem of social 
choice and individual values (and vice versa). Furthermore, one 
would expect that some of the results by, say, the statisticians 
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would have implications for the problem of social choice (and 
vice versa). Some of these relations and implications are 
studied in this paper. A general method of amalgamation is 
presented which includes as special cases (a) the Laplace cri- 
terion for the problem of statistical decision, (b) the method of 
majority rule when this rule leads to a social choice, (c) the 
"reasonable" social welfare function of A. H. Copeland, and 
(d) the Bayes solution to the statistical problem. This general 
method of amalgamation may also be used in order to develop 
still other criteria which are special cases. 



2. SOME METHODS OF AMALGAMATION 

c 
Suppose that ujj is considered as a measure of the 

utility of row i in the case where the utilities are real numbers. 
Then the best row would be the one for which the average of the 
entries is greatest^ This method of amalgamation has been 
called the Laplace criterion. Articles 690, 719, 806, 990 of [24] 
discuss the history of this criterion. (I am indebted to Churchill 
Eisenhart and Dick Savage of the Bureau of Standards for their 
historical research on this problem which brought this reference 
to my attention.) 

If relevant probabilities associated with the columns may be 
computed (i.e., if a number PJ > may be assigned to column j 

for j = 1, 2, , c), then another method of amalgamation, 

c 
called the Bayes solution [3], may be used. Then E ujjpj may 

be considered as a measure of the average utility of row i and 
the best row would be the one for which the average utility is 
greatest. The Laplace criterion is obtained if PJ is the same 
for all j. 

The method of majority rule for the problem of social choice 
is another criterion for amalgamation which may be described 
as follows. The entries in the matrix U are taken to be utilities, 
and with respect to any pair u, v of utilities it is assumed that 
one of the following three decisions can be made: (1) u is pre- 
ferred to v, (2) v is preferred to u, (3) neither is preferred to 
the other. With each pair u, v we associate a number (u,v) 
which equals +1, -1, respectively according as (1), (2), or 
(3) holds. Then row i will win a majority of the votes in an 
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A c A 

election with row k, if 0(i,k) = S s(uij,Ufcj) is positive. Row i 

is considered best if it will win a majority of the votes in an 
election with every other row. That is, that row i is chosen 



for which #(i,k) is positive for all k ^ i. A row with this 
property may not exist and in that case no "best" row is ob- 
tained. 

A. H. Copeland [ll] has suggested another method of amal- 
gamation which he calls a "reasonable" social welfare function. 
This method is a modification of the method of majority rule 
which permits the rows to be ordered. Copeland's social wel- 
fare function might be paraphrased in the following manner. 
Let sgn x be the signum function of x: 

!+l for x>0, 
for x = 0, 
-1 for x<0. 

/\ r A 

We consider 0(1) = S sgn <p(i,k) as a measure of the utility 

k=l 

of row i, and choose that row for which 0(i) is greatest. 

A general method of amalgamation will now be presented 
which will help to make clear the relations between the preced- 
ing methods and which will suggest still other methods of amal- 
gamation. This general method will have as special cases all 
of the preceding methods including the method of majority rule 
when that rule leads to the choice a "best" row: With each pair 
u, v we associate a real number s(u,v) which is positive when u 
is preferred to v, negative when v is preferred to u, and zero 
when neither is preferred to the other. Let t(x) be a non- 
decreasing function of x and let PJ ^ be any number assigned 

to column j. Then row i is better than row k by an amount 

c 
0(i,k) if 0(i,k) ss s(uij,uj c j)pj is positive. As a measure of 

the utility of row i, we take 

r 

t[0(i,k)] 
k=l 

and choose that row with the greatest utility. 

The choice of PJ depends on whether relevant probabilities 
(weights) may be assigned to the columns. In the theory of sta- 
tistical decisions, these PJ will be the a priori probabilities 
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associated with the various states (columns) of nature. In the 
problem of social choice, the PJ might be the weight assigned to 
the values of individual j or the influence which individual j has 
in arriving at a best social choice. The function s(u,v) meas- 
ures how much importance is to be given to the preference be- 
tween the two utilities u and v. The function t(x) measures 
how much importance is to be given to the amount x that row i 
is better than another row in determining the utility of row i. 
For example, in the case where the utilities are real numbers 
and s(u,v) = sgn(u-v), then only the fact that one utility is great- 
er than another utility has significance; and the amount by which 
it is greater is considered irrelevant. Also, if t(x) = sgn x then 
only the fact that row i is better than row k has significance; 
and the amount by which it is better is considered irrelevant. 

In the special case where t(x) = x, the utility of row i may 
be written as 

*(i,k) = E[E 

k=l j=lLk=l 

r 
We might consider Z s(ujj,U( c j) = 0j(i) as a measure of the 

K JL 

utility of row i for column j. Hence, the utility of row i may 

c 
be written as E <J(I)PJ, a weighted sum of the utility of row i 

for the various columns. 

For the Bayes solution, it is assumed that the utilities are 
real numbers, s(u,v) = u - v, and t(x) = x; and the Laplace cri- 
terion is obtained if further the PJ is a constant a. For the 
method of majority rule p j = a and s(u,v) is restricted to the 
values 4-1, -1, 0. The Copeland solution is obtained if further 
t(x) - sgn x. That is, in the method of majority rule and in the 
Copeland solution, the strength of individual j's preference 
s(ujj,u k j) is considered irrelevant; only the fact of whether or 
not ne prefers row i to row k is of significance. 

Suppose we now consider the following modification of the 
Copeland solution where t(x) is taken equal to x rather than 
sgn x. Then as a measure of the utility of row i, we have 

r re c r 

E <*>(i,k) = E E s < u ij> u kj) a E E s(uij,u kj ) . 

k 1 If 1 i 1 i1 lr 1 

iv- 1 It 1 Jl J Ivl 
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Now the rank order a^j assigned to row i by person j is, in 
fact 

aij = i Z s(uij,u k j) + r + 1 . 

Hence, the measure of the utility of row i is 

c 
2 L 1 - c(r+l) . 



Therefore, this modification of the Copeland solution may be 
stated as follows: Replace the entries in the matrix by the rank 
order assigned by each column, and then apply the Laplace solu- 
tion to the ranks. By modifying the values of the pj in the Cope- 
land solution, the rule that the Bayes solution should be applied 
to the ranks is obtained. 

Still another method of amalgamation which is not included 
among those already described herein is the minimax (maximin) 
principle. This principle states that in the case where the utili- 
ties are real numbers we should choose that row where the 
smallest entry in the row will be as large as possible; that is, 
maximize the minimum entry. This principle is sometimes ap- 
plied to a matrix (e.g. of losses) which may be obtained from 
the original matrix u of utilities. When probability mixtures of 
rows (mixed acts) are also considered, the best row (or the 
best probability mixture of rows) will be the solution of the 
matrix considered as the matrix of a two person zero sum game. 

Modifications of the minimax principle have been suggested 
by Hurwicz [19] and by Savage [23]. (See also Chapter IV.) 



3. SOME RELATIONS AND IMPLICATIONS 

For the statistical problem, it has been pointed out (e.g. [23]) 
that knowledge of the values of the entries u^j in the matrix U 
implies economic knowledge not often available to the working 
statistician. There are often problems of a computational na- 
ture which make it difficult to determine the values of the en- 
tries. For some problems, which the statistician might face, 
it may not be possible to determine the values of the entries u^j, 
but he may be able to determine a rank order for the rows i for 
any given state of nature j. In this case, the statistician's prob- 
lem is very similar to the problem of social choice where indi- 
vidual values are expressed as preferences or as rankings 
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For some problems of social choice, the strength of individ- 
ual preferences may also be obtained. In that case, the entries 
uy will describe the strength of the individual j f s preference 
for social choice i. The problem of social choice is then very 
similar to games against nature and the statistician's problem 

It is interesting to note that a result obtained by Blackwell 
and Girschick [8] for the problem of statistical decisions is re- 
lated to the work of Markowitz and Goodman [15,16] on the 
problem of social choice and individual values. Blackwell and 
Girschick [8] show that if there is (1) a complete order of all 
the possible rows (the ujj range over a continuum), (2) Pareto 
optimality, and (3) independence of origin (indifference of na- 
ture) which requires that the complete ordering remain in- 
variant under addition of a constant to the utilities for a given 

state of nature, then the ordering is representable by a linear 

c 
function .S ujj PJ where u^j is the utility of rpw i when nature 

is in state j and PJ ^ is a set of weights which are inter- 
pretable as a priori probabilities. In [2], Arrow points out 
that conditions (1), (2), and (3) had been suggested by Marko- 
witz and Goodman in [15] as possibly reasonable conditions to 
be imposed upon a social welfare function (ordering of social 
choices). In [15] and [16] the weights PJ are interpretable as 
the amount of influence which individual j has in making the 
social choice. 

An interesting relation exists between the work of Clyde 
Coombs [10] and Duncan Black [5]. In [10] Coombs suggests 
three different methods for obtaining social utilities. We shall 
be concerned only with the first two methods. The first method 
gives "equal weight to each individual and weights each prefer- 
ence by its strength. ft He finds that the rank order assigned by 
the median individual will be the rank order of the social 
choices. Another way of saying this would be as follows: Con- 
sider the scale on which the stimuli (rows) fall. Now place 
each individual (column) on this scale at the point which he 
prefers the most. Then the median individual according to this 
ordering is the one who determines the social choice. Coombs 
considered a second method where each individual had equal 
weight and each preferential judgment (vote) was weighted 
equally. Coombs found empirically for the experiments he had 
performed that these methods gave quite similar results. 
The second method considered by Coombs is similar, though 
not identical with the method of majority rule. The following 
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theorem due to Black [5] determines the best social choice 
when the method of majority rule is used: For single peaked 
preferences (the Coombs model in [10] satisfies this condition), 
the majority choice may be obtained by looking only at the first 
choice of the median individual when all of the individuals are 
considered arrayed according to the underlying scale. Hence 
this theorem shows that the first choice obtained by majority 
rule will be identical with the first choice obtained by Coombs 1 
first method. 

Coombs has shown that the entire rank order of the median 
individual will be the rank order obtained by the social utility 
based on strength of preference if the conditions of his model in 
[10] are satisfied. The theorem by Black [5] determines the 
first choice for the method of majority rule, but it does not dis- 
cuss the entire rank order. (The second choice for the method of 
majority rule is that choice which wins a majority over all other 
choices except the first choice. The third choice for the method 
of majority rule is that choice which wins over all other choices 
except the first and second choices, et cetera.) It seems inter- 
esting to note that Black* s theorem cannot be extended to the 
entire rank order. That is, it is not generally true that the 
rank order of the median individual will be the rank order ob- 
tained by the method of majority rule when the condition of 
single peakedness is satisfied. (See [5].) A numerical example 
may be given to illustrate this point. However, it is true that if 
the conditions of Coombs' model in [10] are satisfied, then the 
rank order of the median individual will be the rank order ob- 
tained by the method of majority rule; this is due to the fact that 
for Coombs* model the majority will prefer j to k if and only if 
the median individual prefers j to k. For suppose that i is to 
the left of k on the J scale (i.e., Qhij < Qjufc), then if the 
median individual is nearer j than k, it follows that all the in- 
dividuals to the left of the median individual will be nearer j, 
and hence the majority will prefer j. Also if the majority is 
nearer j than k, it follows that the median individual will be 
nearer j than k. It has been shown that the rank order ob- 
tained by majority rule is determined by the median individual 
in Coombs 1 model, as was the social utility based on strength of 
preferences. Hence, the method of majority rule and the social 
utility based on strength of preferences both lead to the same 
rank order in Coombs 1 model. 
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CHAPTER IV 

GAMES AGAINST NATURE* 

by 

John Milnor 
PRINCETON UNIVERSITY 

1. INTRODUCTION 

The object of this paper will be to study games of the follow- 
ing type. A matrix (ajj) is given in which a player must choose 
a row. A column will be chosen by " Nature", a fictitious player 
having no known objective and no known strategy. The payoff to 
the player will then be given by the entry in that particular row 
and column. This entry should represent a numerical utility in 
the sense of von Neumann and Morgenstern. (See [3] or [l].) 

It will be shown that several known criteria for playing such 
games can be characterized by simple axioms. An axiomatic 
procedure will also be used to criticise these criteria, and to 
study the possibilities for other criteria. 

(Our basic assumption that the player has absolutely no in- 
formation about Nature may seem too restrictive. However 
such no- information games may be used as a normal form for 
a wider class of games in which certain types of partial infor- 
mation are allowed. For example if the information consists of 
bounds for the probabilities of the various states of Nature, then 
by considering only those mixed strategies for Nature which 
satisfy these bounds, we construct a new game having no infor- 
mation. Unfortunately in practice partial information often oc- 
curs in vague, non-mathematical forms which are difficult to 
handle.) 

The following criteria have been suggested for such games 
against Nature. 

Laplace. If the probabilities of the different possible states 
of Nature are unknown, we should assume that they are all equal. 



The preparation of this paper was sponsored in part by the 
RAND Corporation. The author was a National Science Founda- 
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Thus if the player chooses the i-th row his expectation is given 
by the average (au+- +ai n )/n, and he should choose a row for 
which this average is maximized. 

Wald [4] (Minimax principle). If the player chooses the i-th 
row then his payoff will certainly be at least Min aij. The safest 

possible course of action is therefore to choose a row for which 
Min a^j is maximized. This corresponds to the pessimistic hy- 

pothesis of expecting the worst. 

If mixed strategies for the player are also allowed, then this 
criterion should be formulated as follows. Choose a probability 
mixture (1, *,m) of the rows so that the quantity 



Min (i&ij+* +m a mj) is maximized. In other words play 
as if Nature were the opposing player in a zero sum game. 

Hurwicz*. Select a constant 6 a 1 which measures the 
player's optimism. For each row [or probability mixture of 
rows] let a denote the smallest component and A the largest. 
Choose a row [or probability mixture of rows] for which a A + 
(l-a)a is maximized. For a = this reduces to the Wald cri- 
terion. 

Savage [2] (Minimax Regret). Define the (negative) regret 
matrix (rij) by TIJ = a^ - Max a^j. Thus r^j measures the 

K 

difference between the payoff which actually is obtained and the 
payoff which could have been obtained if the true state of Nature 
had been known. Now apply the Wald criterion to the matrix 
(rip. That is choose a row [or mixture of rows] for which 
Min r^j [or Min (^irjj-f +m r mj)l is maximized. 

These four criteria are certainly different. This is illus- 
trated by the following example, where the preferred row under 
each criterion is indicated. 



2 2 1 \ Laplace 
1111 \Wald 

0400 /Hurwicz (for a > 1/4) 
Savage 



0400 ll 
1 3 O/ 
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2. AXIOMATIC CHARACTERIZATION OF CRITERIA 

In this section we will consider criteria which assign to each 
matrix (ajj) a preference relation > between pairs of rows^ 
of the matrix. It will be shown that each of the four criteria of 
1 is characterized by certain of the following axioms. The first 
five axioms are compatible with all four criteria. 

1. Ordering. The relation > is a complete ordering of the 
rows. That is it is a transitive relation, such that for any two 
rows r,r f either r > r f or r f > r. 

2. Symmetry. This ordering is independent of the numbering 
of the rows and columns. 

(Thus we are not considering situations where there is any 
reason to expect one state of Nature more than another.) 

3. Strong domination. If each component of r is greater than 
the corresponding component of r f , then r > r f (shorthand for: 
r >V but not r 1 r). 

(k) 

4. Continuity. If the matrices a^ ' converge to a^, and if 

(k) x (k) 
r ^ r\ ' for each k, then the limit rows r and rj satisfy 

r> rj. 

5. Linearity. The ordering relation is not changed if the 
matrix (aij) is replaced by (a}j) where a}j = Aatj + M, X > 0. 

The following four axioms serve to distinguish between the 
four criteria. 

6. Row adjunction. The ordering between the old rows is not 
changed by the adjunction of a new row. 

7. Column linearity. The ordering is not changed if a con- 
stant is added to a column. 

(This can be interpreted as an assertion that Nature has no 
prejudices for or against the player. It also asserts that the 
utility is linear, not only with respect to known probabilities, 
but also with respect to unknown probabilities of the type under 
consideration.) 
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8. Column duplication. The ordering is not changed if a new 
column, identical with some old column, is adjoined to the 
matrix. (Thus we are only interested in what states of Nature 
are possible, and not in how often each state may have been 
counted in the formation of the matrix.) 

9. Convexity, If row r is equal to the average -5 (r f +r") of 
two equivalent rows, then r > r f . 

(Two rows are equivalent, r f ~r", if r f >r"and r">r f . This 
axiom asserts that the player is not prejudiced against random- 
izing. If two rows are equally favorable, then he does not mind 
tossing a coin to decide between them.) 

Finally we will need a modified form of axiom 6 which is 
compatible with all four criteria. 

10. Special row adjunction. The ordering between the old 
rows is not changed by the adjunction of a new row, providing 
that no component of this new row is greater than the corre- 
sponding components of all old rows. 

The principal results of this section are all incorporated in 
the following diagram, which describes the relations between 
the ten axioms and the four criteria. The symbol "X" indicates 
that the corresponding axiom and criterion are compatible. 
Each criterion is characterized by those axioms which are 
marked "HP 1 . 



1. Ordering 

2. Symmetry 

3. Str. Domination 

4. Continuity 

5. Linearity 

6. Row adjunction 

7. Col. linearity 

8. Col. duplication 

9. Convexity 

10. Special Row adj. 



Laplace Wald Hurwicz Savage 



B 



B 



X 
X 
B 
B 

X 
X 



B 
B 
B 
B 



B 



Diagram 1, X = compatibility. 
Each criterion is characterized by axioms marked 
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Theorem 1. The Laplace criterion is compatible with all of 
these axioms other than axiom 8; the Wald criterion with all but 
axiom 7; the Hurwicz criterion with all but 7 and 9; the Savage 
criterion with all but 6. 

The proofs are all completely trivial. Perhaps the following 
two examples are of interest. In the first matrix the Hurwicz 
criterion (for a > 0) is not compatible with axiom 9 (convexity) 
In the second pair the Savage criterion is not compatible with 
axiom 6 (row adjunction). 



/2 0\ 
(l lOJ 

\0 2 O/ 




Theorem 2. The Laplace criterion is characterized by 
axioms 1,2,3,6,7. 

It is first necessary to prove the following. 

Lemma 1. Assuming axioms 1,2,6 (ordering, symmetry, row 
adjunction) two rows which differ only in the order of their com- 
ponents are equivalent. 

Adjoin a sequence of intermediate rows so that two consecu- 
tive rows differ only by a permutation of two components. The 
result now follows by an application of the symmetry axiom to 
each pair of consecutive rows. 

The proof of theorem 2 follows. Suppose that the average of 
the components of r equals the average of the components of r f . 
Alternately perform the following two operations on the matrix: 

a) Permute the elements of r and r f so that they are in 
order of increasing size. (Permissible by lemma 1, and axiom 6)1 

b) Subtract from each column the component in r or the com- 
ponent in r 1 , whichever is smaller. (Permissible by axiom 7). 
After a finite number of steps, all of the components of r and r 1 
will be zero. It follows that r ~ r f . 

Now using axioms 3 and 6 it follows that r > r f whenever the 
average of the elements of r is greater than the average of the 
elements of r f . Thus the criterion is that of Laplace. 

Theorem 3. The Wald criterion is characterized by axioms 
1,2,3,4,6,8,9. 

Two lemmas are first necessary. 

Lemma 2. Assuming axioms 3 and 4 (domination and con- 
tinuity), if each component of r is greater than or equal to the 
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corresponding component of r f , then r > r f . 
The proof is clear. 5 * 

Lemma 3. Assuming axioms 1,2,3,4,6,8, two rows which 
have the same minimum element and the same maximum ele- 
ment are equivalent. 

Let (aj, , a n ) be any row having the minimum component 
a and the maximum component A. From lemmas 1 and 2 it 
follows that 



(a,- -,a,A) < (aj,- -,a n ) < (A,- - -,A,a). 
But (a,- -,a,A) is equivalent to (A, , A,a) since the matrix 

f * ] * J can be obtained from the symmetrical matrix I . 1 

by column duplication. Therefore any two rows having minimum 
element a and maximum element A are equivalent. 

Proof of theorem 3. By lemma 3 it is sufficient to consider 
pairs (a, A) with a A in place of rows. Applying the convexity 
axiom (9) to the matrix 



(a |(a+A) |(a+A)\ 
a a A j 

a A a / 



we have (a, A) < (a,-g(a+A) ). By repeated application of this 

rule, together with the continuity axiom, we have (a,A) < (a,a), 
hence (a,A) ~ (a,a). It follows easily that the criterion is that 
of Waldo 

Theorem 4. The Hurwicz criteria are characterized by 
axioms 1,2,3,4,5,6,8. 

Again it suffices to consider pairs (a, A) with a ^ A. Let a 
be the supremum of all numbers a f such that 

(',<*')< (0,1). 

By the domination axiom it follows that cr 1. By con- 
tinuity it follows that (a, a) - (0,1). By linearity 

(crA+ (l-a)a, a A + (l-a)a) ~ (a,A), 

whenever a < A. It follows easily that the given criterion is just 
that criterion of Hurwicz which corresponds to the parameter 
value a. 
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Theorem 5. The Savage criterion is characterized by axioms 
1,3,8,4,7,8,9,10, 

A matrix will be called normalized if it contains a row r o 
consisting entirely of zeros, and if it contains no positive com- 
ponents . Any given matrix can be normalized by first sub- 
tracting the maximum element from each column, and then ad- 
joining the row r o . By axioms 7 and 10 these operations do 
not change the ordering relation between the old rows. In a 
normalized matrix we are free, by axiom 10, to adjoin any row 
which contains no positive elements and to delete any row other 
than r . The proof is now completely parallel to the proof of 
theorem 3. It is only necessary to require that all matrices 
considered be normalized. 



3. CRITICISM OF THE CRITERIA 

There is one fundamental principle which has not yet been 
mentioned: that of domination (or admissibility). One strategy 
is said to dominate another if it is just as good in all states of 
Nature and definitely better in at least one. It is natural to re- 
quire that the following axiom be satisfied. 

3 f . If r dominates r f then r>r ! . 

This axiom is not compatible with the criteria of Wald, Hurwicz, 
and Savage. Each of these criteria could be modified in a trivial 
way ^ so as to satisfy 3', but the result would violate the equally 
fundamental axiom of continuity. This difficulty is illustrated by 
the following two examples. 

Example 1. Consider the family of matrices 




where ^ k ^ 1. Mixed strategies are to be allowed. In the 
case k = 1 the second row dominates the first. It is therefore 
natural to expect that the second row should be chosen exclu- 
sively for k = 1, and should be chosen with high probability for 
k close to 1. But according to the Wald and Hurwicz criteria 
(a < 1) the first row should be chosen whenever k< 1. (Compare 
diagram 2). In this example the Savage criterion has the ex- 
pected behavior, but in the following, more complicated, examnl( 
the Savage criterion is also unsatisfactory. 
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Diagram 2. Probability 
of choosing second row 
(Example 1.) 







Parameter k 



Diagram 3. Probability 
of. choosing last two rows 
(Example 2.) 



Example 2. Consider the matrices 

1 
0101 
1 1 k 
,1 1 k 

where O^k^l. For k = 1 the first two rows are dominated, 
yet according to the Wald and Savage criteria these two rows 
should be chosen exclusively whenever k<l. (Compare dia- 
gram 3). Only the Laplace criterion gives a satisfactory solu- 
tion in this example. 

The Laplace criterion has been successful under all of the 
tests which have been made of it, with the single exception of 
axiom 8 (column duplication),, It appears that, if we are willing 
to sacrifice this axiom, then the Laplace criterion is definitely 
the best. However in many applications it is desirable to pre- 
serve axiom 8. This is particularly true in cases where there 
is no clear and natural separation of the possible states of 
Nature into a finite number of distinct alternatives. 

Thus all of the criteria under consideration seem unsatis- 
factory in that they fail to satisfy certain rather basic axioms. 
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4 POSSIBILITIES FOR OTHER CRITERIA 

It has become apparent that no possible criterion can have 
all of the properties that one would desire. It is therefore natu- 
ral to try constructing a list of those properties which are most 
fundamental and important, to see if at least these can be satis- 
fied. The following is the author's attempt to construct such a 
list. Others would doubtlessly have given rather different lists. 

Let S denote the simplex of mixed strategies over the rows 
of the matrix. 

I. To each matrix there corresponds a non- vacuous choice 
set C contained in S. 

(The complete ordering of 2 really gave more information than 
was necessary.) 

II. Symmetry, C does not depend on the numbering of the 
rows and columns. 

III. Domination. Every element of C is undominated ( = ad- 
missible). 

IV. Continuity. If a^ a tj , s (k) eC(a^), and s (k) s, 
then 



V. Row adjunction The choice set is not changed by the ad- 
junction of a new row which is dominated by some old row. 
(Some stronger row adjunction axiom would be desirable, but at 
least this much seems indispensable.) 

The following three axioms are also desirable, although not 
as basic as the first five. 

VI. Column duplication. C is not changed by the adjunction 
of a duplicate of some column. 

VII. Column linearity. C is not changed by the addition of a 
constant to a column. 

VIII. Convexity. C is convex. 

Evidently no criterion which has been mentioned so far 
satisfies all of these axioms. 

Theorem 6. There exist criteria which simultaneously satis- 
fy the preceding eight axioms. 

First consider the following slight modification of the Savage 
criterion. Let S be a convex polyhedron of mixed strategies 
for the player, and let pj, ,Pn be linear payoff functions on 



58 DECISION PROCESSES 

S, corresponding to the n possible states of Nature. The nega- 
tive regret is defined by rj(s,S) = pj(s) - Max pj(s'). The Savage 

S C O 

choice set C(S) consists of the set of all strategies scS for 
which Minrj(s,S) attains its maximum M. Instead we will con- 

j J 
sider the set C(S) consisting of all seS such that 

Min rj(s,S) Z M - t . 

The required criteria are now constructed as follows. Choose 
as parameters an infinite sequence of positive numbers 
\>2> " * " which converge to zero. Define the sets SQ ^SjD 
by SQ = S, Si = C^S^i). As choice set C(^ 2> . . .)(S) we 

take the intersection of the Sj. 

The axioms I through VIII may now be verified. The proofs 
will not be carried out, since they are rather involved (at least 
for domination and continuity). In any case these criteria are 
probably too difficult computationally to be of practical interest. 

A further interesting property which is possessed by these 
criteria is the following. The n payoff functions are all constant 
on the choice set. Thus any two elements in the choice set are 
completely equivalent. 

It is interesting to ask if there exist any simple, computable 
criteria which satisfy all of the preceding conditions. 



FOOTNOTES 

1. Suggested by L. Hurwicz in an unpublished paper. 

2. For simplicity, only pure strategies for the player are con- 
sidered in this section. However the results can easily be gen- 
eralized to the (more natural) case where mixed strategies are 
allowed. 

3. Lemma 2 suggests the following criterion. Define r^>r f if 
and only if each component of r is the corresponding compo- 
nent of r f . It may be shown that this criterion satisfies all 
axioms except 1, and is characterized by 2,3,4,6,7,8, together 
with the transitivity portion of 1. 

4. Let r be preferred to r f (in the modified sense) if either 
r>r f in the old sense (of Wald, Hurwicz, or Savage) or r 
dominates r f . 
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CHAPTER V 

NOTE ON SOME PROPOSED DECISION CRITERIA* 

by 

Roy Radner and Jacob Marschak 
COWLES COMMISSION FOR RESEARCH IN ECONOMICS 

1. SUMMARY 



The purpose of this paper is to apply two currently advocated 
statistical decision procedures to a simple problem and show 
that they result in solutions that have certain undesirable prop- 
erties. Each of the two procedures is a generalization or inter- 
pretation of the minimax principle. The problem consists of a 
game in which an individual observes and bets on the outcomes 
of tosses of a coin with constant but unknown probability of fall- 
ing heads. 



2. INTRODUCTION 

2.1. The Rational Decision-Maker. In this discussion we 
shall consider an individual decision-maker who is rational in 
the following sense: if he can specify a set of "states of nature" 
such that for a given state n and a given strategy he knows the 
probability distribution of outcomes, then he will always 

(1) choose some admissible strategy (when possible), 1 

(2) choose the strategy so as to maximize his expected 
utility, if he knows the true state of nature. 

Let U(s,n) be the expected utility when n is the true state of 
nature and the individual uses strategy s. A strategy s is ad- 
missible if there is no other strategy s such that 

U(s,n) ^ U(s ,n), for all n, and 



*This research was carried out under contract with the Office 
of Naval Research. We are indebted to E. Lehmann for several 
valuable suggestions and discussions. 
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U(s,n ) > U(s ,n ) , for some n o . 

2.2. The Minimax Principle, If the individual does not know 
the true state of nature then, in general, the criterion of admis- 
sibility will not be sufficient to enable him to choose a strategy; 
thus some further criteria are needed. Such a criterion is 
Wald's "minimax principle. 11 (Cf. [8], p. 18) One interpretation 
of this principle is: "Minimax the negative expected utility," 
i.e., choose s to achieve 

min max [-U(s,n)] . 
s n 

This interpretation has been attacked by many as too pessimis- 
tic (cf. for example f?], p. 63), and it is this undesirable prop- 
erty which has, in part, led to the proposal of two alternative 
criteria which we will now consider. 

2.3. The Hurwicz Criterion. The first of these, which might 
be considered as comprising a whole class of criteria, including 
the minimax principle as just stated, is a generalized form of a 
criterion proposed by L. Hurwicz [3], In this generalized form 
it requires that a strategy be chosen which maximizes: 

(1) H(s) = (Sup U(s,n), tnf U(s,n) ) 

n n 

where is some fixed monotone increasing function of each of 
its two arguments. itself is chosen by the decision-maker 
and in some sense characterizes his attitude towards uncertainty. 
A special case (the one actually suggested by Hurwicz) is 

(2) = a Sup U(s,n) + (1-a) Inf U(s,n) 

n n 

where a is some fixed number between and 1. Here a might 
be regarded as a degree of optimism (cf. [4], p. 344). 

We will present an example in which application of the Hur- 
wicz criterion leads to the conclusion that at most one observa- 
tion should be taken in a situation in which common sense de- 
mands that a large number of observations be taken. 

2.4. The Minimax Regret Criterion. A different direction is 
taken by L. J. Savage, J7J, who gives good reason why Wald 
could not have considered negative expected utility as the appro- 
priate thing to minimax (cf. Wald [8], p. 8). Instead, Savage 
says the proper interpretation of the "minimax rule" is: 

"Choose that strategy which minimizes Sup R(s,n) where 

n 
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(3) R(s,n) = Sup U(s',n) - U(s,n) . 

s r 

We will call R(s,n) the regret function. 2 Chernoff, [2], has 
criticized this principle because there are cases in which, if 
the domain S of the player's available strategies is enlarged, 
a new minimax regret solution is obtained which differs from 
the old one, yet is contained in the original S. (This is not sur- 
prising, since the value of R(s,n) for any pair (s,n) depends 
upon the domain S. Note that this is not true of H(s). ) 

It is interesting to note that the idea behind Chernoff f s objec- 
tion has analogues in Nash's treatment of the bargaining problem 
(postulate 7, p. 159 of [5]) and Arrow's discussion of social 
welfare functions (Condition 3, p. 27 of [l]). Borrowing Arrow's 
terminology we shall say that in the kind of cases described 
above the minimax regret solution is "dependent upon irrelevant 
alternatives,," 



3. A NON-SEQUENTIAL GAME 

3.1. General Description. Consider the following game: The 
player observes an odd number (2k + 1) of tosses of a coin with 
a constant but unknown probability p of falling heads (and q = 
1 - p of falling tails), whereupon he makes a bet on the outcome 
of the next toss, wins one dollar if his prediction is correct and 
loses one dollar if incorrect. Each toss costs the player c 
dollars, and he must decide in advance the (odd) number of 
tosses he will observe before betting. The player is also free 
not to enter the game at all. This last possible decision we will 
call the null strategy. Aside from it, any pure strategy of the 
player consists of a number k, which determines that he will 
bet after 2k + 1 tosses, and a rule r, which determines for 
every set of observations (sample) which way he will bet A 
mixed strategy is a probability distribution on the set of pure 
strategies (k,r). 

We shall see that for both types of solutions there is an op- 
timal rule r m which requires that the player bet with the 
majority of previous tosses. This will be called the maximum 
likelihood rule. It will be shown that the Hurwicz solution has 
the property that for any (positive) cost c, and any 0, no more 
than one observation should be taken. In the special case of a 
linear [equation (2)], the solution is: if a 2c, bet after one 
observation, if a 2c do not play. 



64 DECISION PROCESSES 

The minimax regret solution is of the form: randomize be- 
tween two adjacent values of k, these values being certain non- 
increasing functions of c; if c is less than a certain quantity, 
randomize between one observation and not playing. However, 
if we modify the game by compelling the player always to use 
the maximum likelihood rule, the optimal number of observa- 
tions will be seen to differ from that in the solution of the more 
general game. 

3.2. Hurwicz Solution. For any non-null strategy the ex- 
pected gain cannot be more than 1 - c(N + 1) where N is the 
expected number of observations, since this is the gain if the 
prediction is correct with certainty. On the other hand, for any 

non-null strategy the expected gain for P = ^ is -C ( N + *) hence 

the minimum cannot be more than -c(N + 1). 

If the player uses the maximum likelihood rule, then the ex- 
pected gain is exactly 1 -c(N + 1) for p = or 1, while it is 
never less than -c(N + 1). (If the reader is not immediately 
convinced of this latter statement he can examine the expected 
gain function in more detail in the next section.) Moreover, 
given that a non-null strategy is used, the smallest possible 
value of N is 1. Hence among non-null strategies both 

Sup U(n,s) and Inf U(n,s) are maximized by using the maximum 
n n 

likelihood rule and taking one observation, and no matter what 
the and c the optimal procedure will have the property that 
no more than one observation is taken. If has the linear form 
of (2) and if s is the strategy which consists of using the maxi- 
mum likelihood rule after one observation with probability v 
and not playing with probability 1 - v, then 

H(s ) = a v(l - 2c) + (1 - a) v(-2c) 
= v(a - 2c) . 

Thus H(s ) is maximized by taking v equal to 1 or according 
as a 2c or a 2c. 

3.3 Minimax Regret Solution. This solution is not so easily 
obtained as the one imposed by the Hurwicz criterion, and we 
will only sketch the method of arriving at it. 

Let d(k,r) denote a joint probability distribution of k and the 
rule r and let d(r|k) be the conditional distribution of r given 
k. 
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Denote the player's expected money gain using d(k,r), given 
p, by U(d(k,r),p); the expected money gain using d(r|k), given 
k and p, by U k (d(r|k),p); and the null strategy by k = -1. Then: 

(4) U(d(k,r),p) = f(k)U k (d(r|k),p). 

k=-l 

Let hi and t i be the events of getting i heads and i tails re- 
spectively with respective probabilities hi(p) and tj(p) 
(i = 0, , k + 1). Any d(r |k) is a rule of the form: 

"For given k, if hi, bet on heads with the probability 
and if ti bet on tails with probability T^." 

The maximum likelihood rule r m is defined by /^i = Li = * 
If d(r|k) differs from r m , it will do so exactly on certain 
events hj (j in J) and t^ (/in L). It is easily verified that: 

(5) U k (r m ,p) = (p - q) 2 [^(p) - t^p)] - 2(k + l)c 

(6) U k (d(r|k),p) = U k (r m ,p) 

+ 2(p - q)[z t^pXl - ^) ~ Jhj(p)(l - ^)]. 

It is not hard to show that we can reject as inadmissible all 
strategies such that there is some k (with f (k) ^ 0), for which 
J and L are not disjoint. The set of remaining strategies we 
will call S. 

We want that d(k,r) which minimizes the supremum, with 
respect to p, of the regret: 

R[d(k,r),p] = U(p) - U[d(k,r),p], where 

U(p) = Sup U[d(k,r),p] . 
d(k,r) 

U(p) is attained, for every p, if the player bets on heads 

when P ^ 9' on tails when p ^ 2 and pays as sma11 a cost as 
possible (i.e., k = 0) provided the resulting expected gain is 
positive; otherwise it is attained by not playing. Thus: 

U(p) = max ||p - q|- 2c 9 0} . 

Let h(k,p) and t(k,p) be the probabilities of majorities of 
heads and tails respectively. 

Then for a strategy using the maximum likelihood rule, the 
regret is: ^ 



P(f,p) = jftopkfp) where, for k > 0, 
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2(q-p)h(k,p) + 2kc, (q - p) 2c 
p k (p) = 2(k -i- l)c - (p - q)(h(k,p) - t(k,p) ), p - q S 2c 

2(p - q)t(k,p) + 2kc, (p - q) 2c 
and 

P-l(p) = U(p). 

The function p(f,p) is symmetric in p, for all f, and has a 
maximum at two points, say pi and qj = 1 - pj, if c < c ; or 

at 7:, if c c , where 

5 + c< P1 <1 
and c is defined by 



00 



c = 



i.e., C Q is the cost per observation for which the three relative 
maxima of p(f,p) are equal. 

Next, it can be shown that one of the mini max regret strate- 
gies uses the maximum likelihood rule. The important step in 
the proof of this point is the fact that (when c < c ), if the 
regret for some strategy s at p = pi is less than P(f,pj) then 
at p = qj it is greater than p(f,qi), and vice versa. 

It remains now to find the optimal distribution f of k, when 
the maximum likelihood rule is used. 

Although Pfe(p) is defined only for integral values of k, it is, 
for every fixed p, analogous to a convex function of k, in that 
for every integer k (and fixed p): 

p k+l(P) ' Pk(P) * Pk+2<P) ' Pk+l(P)- 

It is shown in [6] that in such a case the only admissible strate- 
gies (using the maximum likelihood rule) are those such that 
f (k) is concentrated on at most two consecutive integers. Since 
such a distribution is determined by its mean we can express 
the solution by a single number k, which will be a function of c. 
The approximate value of this function k(c) has been deter- 
mined numerically for several values of c, and the results are 
given in Table 1 below. ' 
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3.4. Dependence on "Irrelevant Alternatives. " We shall now 
show that in this game the minimax regret solution "depends 
upon irrelevant alternatives. " 

Suppose we modify the above game by requiring the player to 
use the maximum likelihood rule. We proceed to obtain the 
minimax regret solution for this case. 

The expected gain using f(k) is given by (4) and (5). Again 
the negative of Ufc(r m ,p) is convex in k for every p, in the 
sense described above, and the only admissible f f s are those 
which are zero at all but at most two consecutive values of k. 
s in Section 3.3, we have obtained the value of the function 

), describing the optimal strategy, for various values of c 
The results are given in Table 1. 



Cost c 


.001 


.002 


.005 


.010 


.020 


.050 


k for General Strategy Domain 


13.8 


8.9 


4.3 


2.4 


1.2 


0.2 


K for maximum likelihood 


9.1 


5.4 


2.6 


1.8 


.7 




Strategy Domain 















Table 1. Minimax Regret Solutions for 2 Strategy Domains 

We recall that the two games considered differ only in that in 
the first game the player is free to use strategies which do not 
incorporate the maximum likelihood rule, while in the second he 
must use that rule. Nevertheless, in the first game the optimal 
strategy is shown to use the maximum likelihood rule, but with 
a different number of trials 2k + 1. 



4. A SEQUENTIAL GAME- THE HURWICZ SOLUTION 

It is worthwhile pointing out 4 that the essential feature of the 
Hurwicz solution in Section 3.2 carries over to a sequential gen- 
eralization of the first game. That is, if we allow the player to 
decide when he will make his bet after having seen any number 
of observations, it remains true that any optimal strategy will 
not involve taking more than one observation. The proof of this 
for general is practically the same as that for the non- 
sequential game. 
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FOOTNOTES 

1. In our examples there will always be admissible strategies,, 

2. Savage calls this the "loss function" but economists and 
others are liable to confuse this with negative income. 

3 Computations for this and the following section were made 
under the direction of J. Templeton and W. Parrish. 

4. We are indebted to E. L. Lehmann for doing so to us. 
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CHAPTER VI 



SOCIAL CHOICE AND STRENGTH OF PREFERENCE 4 

by 
Clyde H. Coombs 

UNIVERSITY OF MICHIGAN 



I. NATURE OF THE PROBLEM 

The problem of constructing a mechanism to aggregate the 
preferences of individuals into a social preference pattern has 
been of concern to psychologists only implicitly. Papers ex- 
plicitly concerned with the problem are usually found in the 
literature of economics and mathematics. Psychologists, how- 
ever, in dealing with their measurement problems, have built 
formal mechanisms which, while never mentioning "social utili- 
ty, " actually constitute mechanisms for merging the preferences 
of the individual members of a group. 

The nature of the problem of defining a social utility is de- 
scribed by Arrow in his recent monograph on Social Choice and 
Individual Values [l]. Consequently, the problem will merely be 
illustrated here by a hypothetical setting in which it might arise. 
Imagine a group of individuals who are all members of an art 
society which has money in its treasury to purchase some 
paintings. Each member of the society has his individual pref- 
erences among the paintings available in the market. The 
strength of an individual's preference for a painting will be re- 
ferred to as the utility of an individual for a painting. 

The problem facing the society is to arrive at a mechanism 
for merging the utilities of its members for the various paintings 
in order to obtain a social choice giving a decision as to which 
paintings are to be purchased. Because paintings vary in price 
it is necessary that the social preference pattern be simply or- 
dered in order that the purchasing agent be completely instructed, 
regardless of the state of the market. 



*This paper was prepared as part of a research project under 
Contract Nonr: 37400 with the Office of Naval Research. I am 
indebted to my research assistant, Mr. J. E. Keith Smith, for 
invaluable assistance throughout this research. 
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Defining a social choice function always involves, implicitly 
or explicitly, two value judgments: 1) the relative weight to be 
assigned to each individual in the society and 2) the relative 
weight to be given to each preferential vote. These value judg- 
ments are sometimes explicit in real mechanisms and always 
explicit in formal mechanisms. The social scientist is inter- 
ested in making such value judgments explicit because they re- 
flect the culture of the group. Also a group is in a better posi- 
tion to objectively select a real mechanism for arriving at a 
social choice function when the alternative mechanisms are 
visible and their respective value judgments explicit. 

There are two ways in which this problem may be studied: 
empirically and theoretically. These two approaches correspond 
respectively to the left and right side of Figure 1 in "Some Views 
on Mathematical Models and Measurement Theory" (Chapter II). 
The empirical approach would be concerned with the nature and 
characteristics of the social utility arrived at by group processes 
of different kinds as for example the group decision of a jury in 
which there is vocal interaction and unanimity is required, as 
compared with the election of a president of a professional so- 
ciety by a process of preferential voting. With these charac- 
teristics as a starting point a theoretician would be interested 
in constructing a formal model, i.e., specifying the axioms, 
which would deliver such a choice function. 

The theoretical approach to the problem of defining a social 
utility would start out with a set of axioms designed to arrive at 
a social utility with certain characteristics. This would then 
constitute an "available" mechanism for a group to deliberately 
select for the purpose of arriving at a social utility and it might 
be a mechanism which describes the process already being used 
by some groups. 

The direction of approach in this paper is a theoretical one, 
that of constructing a social utility which will have certain char- 
acteristics. The particular problem which has intrigued the 
writer is the problem of constructing a social utility which will 
weight preferential judgments by their "strength." 

This problem is difficult because comparability of utility 
measures between individuals is required. Arithmetically 
averaging strengths of preference over individuals requires the 
assumption of the existence of a common unit of measurement 
for utility and then an actual numerical estimate of it. We shall 
here define a social utility which will contain the assumption of 
the existence of a common unit of measurement for utility be- 
tween individuals, but no numerical estimate is required. 



SOCIAL CHOICE AND STRENGTH OF PREFERENCE 71 

Furthermore, certain of the conditions which data must satisfy 
for the assumption to hold will be verifiable. 

The social utility to be constructed here is derived from a 
theory of psychological scaling published elsewhere ([5], [6]) so 
only certain necessary elements of the scaling theory will be 
presented. To illustrate the stringent conditions which data 
must satisfy to permit construction of a social utility which 
weights strength of preference, an experiment was run and will 
be utilized for constructing such a social utility. These same 
data will provide an empirical basis for comparing this social 
utility with two other mechanisms commonly used by psycholo- 
gists for the same implicit purpose. For all these social utili- 
ties, individuals will be weighted equally. 



II. EXPERIMENTAL SETTING 

The experiment conducted to illustrate the problem and the 
scaling theory is the following. The objets d'art to be judged 
consisted of a series of isosceles triangles with a base of one 
inch, which varied in altitude from .25 to 2.5 inches in steps of 
.25 inches. The individuals were presented the triangles in sets 
of three and asked to judge the most preferred and least pre- 
ferred in each set. Every individual was presented with every 
set of three triangles. There were ten triangles, so there were 
120 triads and there were 31 individuals making the judgments. 
The presentations were randomized with respect to the fre- 
quency with which any given triangle occurred in each position 
within triads and with respect to order in the sequence of triads. 

Analysis of the data was somewhat different for each of the 
social utilities to be illustrated. A general scaling theory, in 
the context of which each of these social utilities will be com- 
pared, will next be discussed. 



III. THE UNFOLDING TECHNIQUE 

As much of the material required by this paper on the subject 
of the basic scaling theory has now been published ([5], [6]) only 
a brief sketch of that part of it which pertains particularly to the 
problem of defining a social utility will be discussed. 

When an individual makes a preferential choice between two 
or more stimuli, the individual will be conceived as possessing 
an ideal and the stimuli, the objects being judged, are evaluated 
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with respect to their relative distance from this ideal. In the 
context of this experiment, an individual is conceived of as 
having an "ideal" triangle, at any given moment, which would 
be preferred to all others. This ideal would correspond to a 
point on a continuum of "altitudes of isosceles triangles with 
one inch bases." This point will be designated the C value of 
an individual with the subscripts h, i, and j designating re- 
spectively the moment in time, the individual, arid the stimulus 
being evaluated. 

The stimulus is also conceived as possessing some degree 
of this same attribute, related possibly to its particular alti- 
tude, and this magnitude will be designated its Q value and will 
also have the subscripts h, i, and j. The difference between 
the Q value of the stimulus and the C value of the individual, 
his ideal, will be defined as his utility for the stimulus, |Phij| 
Thus, in a one dimensional case: 



= |Qhij 



- C hij 



and the postulate which defines the information in a comparative 
preferential judgment between two stimuli, j and k, may be 
written as follows: 

(2) |P hij | ^ |Ph ik | if and only if j >k 

where the symbol > signifies the verbal response "preferred 
to". 

Equation (1) constitutes an observational equation of prefer- 
ence^ and the information obtained in an observation is given by 
equation (2). The left side of equation (1) will be referred to as 
the phenotypic level of behavior corresponding to the manifest 
observed behavior, the right side of the equation as the geno- 
typic level, an inferred, hypothetical level of description. A set 
of preference data consists of a number of simultaneous equations 
with only certain order relations given for the terms on the left 
hand side. The problem is to go from the information given on 
the left hand side of the equation to the inferences that can be 
drawn about the quantities on the right hand side, constituting 
a solution to the simultaneous equations. 

If there is a single common latent attribute underlying or 
generating the preferences of the individuals, the analysis of the 
data immediately yields this attribute in the form of order rela- 
tions on the Q values, order relations on the C values, and in- 
formation on metric relations. 
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Consider the following illustration. Suppose there were a 
single latent attribute with the stimuli (A, B, C, D, E) on it as 
in Figure 1, and the judges 1 ideals distributed over the entire 
range. 
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Fig. 1 



This will be called a J scale or Joint scale because it has 
both people and stimuli on it. When an individual makes a 
preference judgment between any two stimuli, then, according 
to equations (1) and (2), he will prefer that stimulus which is 
nearer his ideal. If an individual says he prefers stimulus A 
to stimulus B, this indicates that at the time of that judgment 
his ideal was located to the left of the midpoint between these 
two stimuli. Similarly, if he says he prefers B to E, he is 
located to the left of the midpoint BE. Hence, the midpoints of 
all possible pairs of stimuli partition the continuum into seg- 
ments. If an individual in the course of making all his paired 
comparison judgments is located in one of these segments, then 
these judgments would be consistent and transitive and could be 
completely represented by a rank order of the stimuli. 

This rank order of the stimuli is called the I scale of the 
individual and each I scale corresponds uniquely to a segment 
of the J scale bounded by an adjacent pair of midpoints (except 
for the two end segments). An I scale may be looked upon as 
the J scale folded at the ideal of an individual with the stimuli 
ranked in order of increasing distance away from the individual. 
The data consist, then, of these I scales, and the J scale is ob- 
tained by unfolding them. This is the reason for the name, Un- 
folding Technique. 

In order to unfold a set of I scales it is necessary first to 
order them. It is immediately apparent that every I scale in a 
set ends in one of two stimuli, these being the first and last on 
the J scale. Also, every complete set contains two I scales 
which begin with one of these two stimuli and end with the other 
and these two I scales are mirror images of each other. These 
immediately provide a simple ordering of the stimuli on the J 
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scale. The remainder of the I scales are ordered between the 
first and last by the simple rule that every stimulus must first 
move to the left end of an I scale before it can move to the 
right. Two adjacent I scales then differ in that two adjacent 
stimuli are in different order. This reversal indicates the mid- 
point that was crossed in passing from one to the next. 

Given the order of the stimuli on the J scale there is a nec- 
essary partial order on their midpoints. For example, the mid- 
point BC necessarily precedes the midpoint BD. The data, 
however, yield a simple order on these midpoints and it is this 
fact which provides metric information. The hypothetical data 
in Table 1, based on the J scale of Figure 1, indicate that the 
midpoint AD precedes BC. This will occur if, and only if, the 
distance between stimuli A and B is greater than the distance 
between the stimuli C and D. If the reverse were true, then 
the midpoint BC would precede AD and 14 would have been 
CBADE. 
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Table 1. 



The I scales which lie between the first three and last three 
in a set are the ones which provide metric information. The 
metric information contained in these data, where n = 5, are 
given in Table 1 and illustrated by the partial order in Figure 2. 
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It can readily be shown that 
metric relations on the magnitudes 
of the intervals bounded by the 
midpoints of pairs of stimuli, fol- 
lows from information about the 
metric relations on the distances 
between stimuli. The results of 
such an analysis are contained in 
Table 2 which gives the informa- 
tion on how large each interval is 
in terms of the distances between 
stimuli, for this example. Figure 3 
gives the ordering of these intervals in relative magnitude. 




Fig. 2. 
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This hypothetical data has been presented in order to provide 
some general background on the Unfolding Technique and to in- 
dicate some of the conditions the data must satisfy to provide 
experimental evidence for inter- individual comparability of 
utility. 



IV, A SOCIAL CHOICE FUNCTION BASED 
ON STRENGTH OF PREFERENCE 

Let us now look at this model from the point of view of con- 
structing a social utility. The mechanisms usually used for such 
a purpose involve the relative frequencies of the paired compari- 
son judgments. In other words they deal with the phenotypic 
level of behavior. It is proposed here to construct a social 
utility on the inferred genotypic continuum. 

A social utility may be constructed based on the value judg- 
ments of giving equal weight to each individual, but weighting 
each preference by its "strength, 11 defined by equation (1). Such 
a social utility would be the arithmetic mean of the individuals 1 
utilities for each stimulus. Thus, the social utility for stimulus 
j would be: 



These quantities cannot be arithmetically determined because 
the preferential judgments do not give the absolute magnitude of 
the strength of the individual's utilities for each stimulus but 
just their relative magnitude. Given certain conditions, however, 
these quantities, |p| . .j, can be simply ordered over j. 

The conditions required are the following: 

1) The preferential judgments must satisfy the conditions for 
a single latent attribute, i.e., the I scales must unfold into 
a single quantitative J scale. 

2) The judges must be distributed symmetrically on the J 
scale. 

3) The existence of a common metric on this J scale must 
be assumed. 

The first is a condition which perhaps can be relaxed and 
generalized to multidimensional latent attributes. The frequen- 
cies of the I scales in combination with the metric relations will 
provide some information relevant to the second condition. The 
third is a fundamental assumption which appears to be unavoidabl 
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if one requires that a social choice function weight the strength 
of preferences. 

With these assumptions, it follows that the quantity, | P| . .j, 
the social utility of stimulus j, is the arithmetic mean of the 
absolute deviations of the individuals about stimulus j as an 
origin. In other words, the social utility of a stimulus is the 
mean deviation of the individuals about stimulus], as the origin. 
From statistical theory, we know that the mean deviation will 
be a minimum about the median and will be increasingly large 
as the origin deviates from the median. Hence, the median in- 
dividual will be that origin about which the quantities |P|. . j 
will be a minimum for any j. So, the rank order I scale of the 
median individual in a distribution will be the rank order of the 
minimal values of |p|. . j over j and will constitute a social 
utility which has the characteristic of being the least disliked. 
If the data satisfy the necessary conditions, which are very 
strict, a simply ordered social utility which weights strength 
of preference is then obtained. 

These assumptions are very stringent. For example, if the 
conditions are satisfied, then if one individual's ideal is A and 
he dislikes B very much, and another individual's ideal is B, 
it necessarily follows that the second individual dislikes A very 
much, and in fact his dislike for A must be equal to the first 
individual's dislike for B. 

Leo Goodman** has brought to the writer's attention the fact 
that Duncan Black [4] has pointed out that if the condition of 
single peakedness is satisfied (implied by equations (1) and (2)), 
then the first choice of the median individual is, in fact, the 
most preferred stimulus as obtained by the system of majority 
voting. 



V. RESULTS OF THE EXPERIMENT 

Real data usually do not result in a set of I scales which will 
unfold into a single quantitative J scale. This scaling theory is 
very vulnerable; it does not necessarily yield unidimensionality, 
for example, but rather sets up conditions which the data must 
satisfy in order to draw the inference of a unidimensional latent 
attribute. It is to be expected that judgments of individuals will 
contain error. Even in the case of a single latent attribute one 
can conceive of an individual vacillating or being uncertain as 
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to precisely where he stands on the continuum. Over a number 
of judgments, an individual may actually have a distribution of 
C values or ideals and, for judgments at different times, be 
located at different places in such a distribution. 

By using a method of collecting data which requires replica- 
tion of a paired comparison, as does the method of triads, one 
can determine the percentage of times an individual's ideal was 
on either side of a given midpoint. This permits a partial re- 
construction of an individual's distribution of ideals. 

This experiment on the aesthetics of isosceles triangles in- 
volved 10 triangles. Hence, there are 45 midpoints and the con- 
tinuum is divided into 46 intervals corresponding to a set of 46 
I scales. 

The ten triangles used in this experiment differed in altitude 
by increments of .25 inches, so the entire range of differences 
in altitude was 2.25 inches. This continuum, being broken into 
46 intervals, means that on the average these intervals repre- 
sented a physical range of .05 inches in altitude. It is perhaps 
to be expected then that individuals would vary over a number 
of adjacent intervals unless they could discriminate altitude 
very well and knew precisely what altitude they most preferred 
and didn't vary from it. It was found that practically all the 
subjects did vary on the continuum. Hence, for any pair of 
stimuli, the individual's preference was taken to be that member 
of the pair which was most frequently preferred. This would 
correspond in an ideal case to the median position of his distri- 
bution on the J scale. 

This experiment was conducted with students in two graduate 
courses at the University of Michigan, a total of 31 students. 
This is clearly not enough subjects to have some in every inter- 
val of the J scale, but the results will serve as an illustration 
of the Unfolding Technique and the problems one has in arriving 
at this social utility from real data. 

The preferential judgments of the individuals are summarized 
in Table 3. In the first column are the intervals numbered from 
1 to 46. In the second column are I scales obtained in the data 
which all satisfy a common quantitative J scale with the stimuli 
in order from A to J. Each occurred with a frequency of one. 
In the next column are listed alternative I scales all of which 
satisfy the same qualitative J scale, with the stimuli ordered 
from A to J, but the metric relations implied are not the same 
as those implied by the I scales in the first column. In the col- 
umn headed M Others" are listed those I scales which do not 
even satisfy the same qualitative J scale. 
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Int. 
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Table 3 
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If we look at the seventeen cases in the first column (the only 
ones which appear to satisfy the conditions for arriving at our 
social utility) the I scale of the ninth individual would constitute 
a simply ordered social utility which weighted strength of pref- 
erence. This is the I scale in the twenty-fifth interval, 
FEGDHCBIJA. 

This mechanism for achieving a social utility has certain 
serious limitations. While on the one hand, some of us may find 
it "satisfying" to construct a social utility which weights the 
strengths of preferences, this has only been done for the special 
case in which there is a single latent attribute underlying indi- 
viduals* preferences, and with strong additional assumptions. 
While some of the problems involved in extending this mecha- 
nism to cases with multidimensional latent attributes have been 
solved [3], additional problems remain, such as, for example, 
determining a median point in a multidimensional space [7] and 
also criteria for unfolding. 

Let us turn next to another mechanism for arriving at a 
social utility. 



VI. A SOCIAL UTILITY WITH VOTES WEIGHTED EQUALLY 

In a study by Austin and Sleight [2] on the aesthetics of isos- 
celes triangles, they were interested in the so-called golden 
ratio of altitude to base, 1.62 to 1, which is supposed to be the 
most aesthetically pleasing. They used twelve triangles, 52 
subjects and the method of paired comparisons. They deter- 
mined the per cent of times each triangle was preferred to some 
other, and thereby, though not explicitly, constructed a social 
utility. The results they reported are illustrated in Figure 4, 
copied from their article. The rank order of the triangles on 
the social utility so constructed can be read from the figure and 
is EFDGCHIJBKLA. (The stimuli A through J have the same 
designations as in the experiment reported previously; stimuli 
K and L are two additional triangles they used, with altitudes 
of 2.75 inches and 3 inches, respectively.) 



SOCIAL CHOICE AND STRENGTH OF PREFERENCE 81 



80 
70 
60 



<D 50 

o 

*0 

40 



30 



20 



10 



0.25 0.50 0.75 1.00 1.25 1.50 1.75 2.00 2.25 2.50 2.75 3.00 
Altitude to Base Ratio (Base = 1 inch) 

Fig. 4. Austin-Sleight Data (N = 52) 



Their data are not reported in such a manner that the Un- 
folding Technique could be applied to permit constructing a 
social utility which weighted the individual preferences by their 
strength. It is evident, however, that their social utility may 
also be obtained by folding the J scale in the neighborhood of 
stimulus E, and in fact is from the 26th interval of a J scale 
with the stimuli going from A to L. 

The value judgments underlying their social utility are the 
following: 1) every individual is given an equal voice; 2) each 
preferential judgment (dyadic) is given equal weight. Such a 
social utility appears on experimental grounds to be a folded J 
scale and on theoretical grounds is sensitive to both the metric 
relations and the distribution of judges on the J scale. It is not 
yet clear whether these two effects on the social utility can be 
distinguished, to permit unfolding it into a J scale with metric 
relations. 

The data from the first experiment reported here may also 
be used to construct a social utility based on the same value 
judgments as the Austin-Sleight social utility. The result of this 
is a social utility with the stimuli in the order FEGDHCIBJA, 
which is one of the possible I scales which could occur in the 
26th interval of the J scale. 
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They make an observation about their data which is a very 
significant one. They observed that very few subjects had their 
strongest preference for the stimuli which were highest on the 
social utility. Most of the best-liked triangles were at the ex- 
tremes. While many subjects preferred most the triangles at 
one extreme, many other subjects had a definite dislike for 
these same triangles and made first choices at the other ex- 
treme. When the data are combined into a social utility, the 
"average" preference for the extreme triangles drops. The 
result is that the social utility constitutes not necessarily the 
order in which the triangles are most preferred, but rather the 
order in which they were least disliked. The triangles in the 
middle of the J scale were not disliked by any of the subjects 
and these are the ones which come out highest on the social 
utility. This might provide a general characterization of the 
social utilities arrived at by the political mechanism of democ- 
racy when there are more than two stimuli to be ordered. 

In the context of aesthetics, the so-called golden section ratio 
of altitude to base, 1.62 to 1, is not necessarily the most aes- 
thetically pleasing, but rather is least aesthetically displeasing 
over a number of individuals. These are by no means neces- 
sarily the same thing. 



VII. THE SOCIAL UTILITY OBTAINED FROM THE LAW 
OF COMPARATIVE JUDGMENT 

We have now examined two social utility mechanisms based 
on different value systems and it will be of interest to examine 
a third which is widely used by psychologists. In 1927, 
Thurstone published his "Law of Comparative Judgment" [8] 
as a mechanism for constructing a scale from paired compari- 
son judgments. Insofar as the formal and computational char- 
acteristics of the Law of Comparative Judgment are concerned, 
there is no distinction between judgments as to which of two 
stimuli is preferred and which of two stimuli possesses more 
of some property. From the point of view of the Unfolding Tech- 
nique, however, these are distinctly different. It is only the first, 
pertaining to judgments of preference, which provides another 
mechanism arriving at a social utility. 

In the notation of the Unfolding Technique the postulate under- 
lying the Law of Comparative Judgment is that given by equa- 
tion (2) which for this purpose can be put in the more convenient 
form: 
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(4) |P hij | - |P hik |< if and only if J >k 

If each of a number of individuals makes a judgment of pref- 
erence between each pair of stimuli, the percentage of times 
j > k is obtained. This corresponds, by the postulate, to the 
percentage of times the difference, (Phijl " | p hik|> was less 
than zero over h and i. 

The distribution of these differences has mean |P|. . j - 

I P|. k anc * variance or ,^ , . . Making the judgment 

|*r -j " I F I k 

that these differences are normally distributed, then we may 
write: 

(5) |P|..j-|P|-- k =X jk "IPI..- 



where Xj k is the normal deviate of a difference of zero. For 
each pair of stimuli (j, k), an observational equation may be 
written as above. The set of simultaneous equations so obtained 
is insoluble, as there are always more parameters than equa- 
tions. Thurstone's Case V of this Law of Comparative Judgment 
eliminates the majority of the parameters by assuming that the 
variance of the differences is constant for all pairs (j,k). It may 
then be set equal to one and used as the unit of measure. Thus, 

one has (2) experimentally independent observations and solves 

for n - 1 parameters: a scale value for each stimulus, with one 
set equal to zero to provide an origin. 

When this procedure is applied to preference data, a numeri- 
cal scale is obtained which may be regarded as the attribute 
M preferability M and the stimuli are points on this scale. 

It is of interest, now, to regard this mechanism as one for 
arriving at a social utility. From this point of view, it is evi- 
dent that the value judgments in this mechanism are as follows: 

1) each individual is given an equal voice; 

2) a preference judgment is weighted monotonically with its 
popularity. 

The latter statement arises from the fact that the normal curve 
is used to obtain the quantity Xj^ in the observational equations. 
As the percentage of judgments for one stimulus over another 
deviates from 50%, the normal deviate for a zero difference 
moves increasingly farther from the mean of the distribution 
and the increment in this distance attributable to each vote in- 
creases, on the average. 
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The data obtained in the experiment reported here on pref- 
erences for triangles could be used to secure this social utility 
also. Because the Method of Triads was used to collect the 
data instead of the Method of Paired Comparisons, each indi- 
vidual replicated each judgment eight times. So the eight re- 
sponses of each of the seventeen judges were tabulated to de- 
termine the proportion of times one stimulus was preferred to 
another. The social utility so obtained is given in Figure 5. It 
may be noted that the order of the stimuli on the social utility 
is the same for the Law of Comparative Judgment solution and 
for the social utility secured by weighting each vote equally. 
This will not always be the case but will be dependent upon the 
distribution of the judges' ideals on the J scale. 






Fig. 5 Law of Comparative Judgment Solution 
to Triangle Data. 

It is evident from this development that the social utility 
arrived at by Austin and Sleight has some of the characteristics 
associated with the judgment that the distribution of differences, 
| p hij I " l p hik| > is rectangular in contrast to normal. It has 
probably been noted that the designation of the shape of this dis- 
tribution of differences, both here and for the Law of Compara- 
tive Judgment, was referred to as a judgment and not as an 
assumption. The reason for this is that in the context of defin- 
ing a social choice function, there is no natural M reality M but 
one which is constructed by the decisions and value judgments 
of the group. The shape of this distribution of differences 
simply corresponds to a value judgment on the relative weights 
which should be given to individual preference judgments to 
merge them into a social utility. 
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VIII. IMPLICATIONS FOR EXPERIMENT 

It is evident that as wide a variety of social utilities can be 
formally constructed as there are varieties of value judgments 
on the weighting of individuals themselves and the weighting of 
the individual preference judgments. 

A problem of interest to the social scientist is to match the 
live mechanisms of particular groups with one or more of the 
various formal mechanisms in order to make explicit the value 
judgments that characterize certain operational procedures that 
may be used to arrive at a social utility. 

For example, it would be instructive to analyze the implicit 
value judgments underlying determination of a social utility by 
a jury, in which unanimity must be obtained and there is open 
discussion. This could be studied in contrast to those obtaining 
for a legislative group, with discussion, but not requiring una- 
nimity on the social utility. 

It would be interesting to know how the way political leaders 
decide on order of importance of items in the party platform 
differed from the mechanism that characterizes the jury. Other 
variants, and experimental problems, are numerous and obvious. 



FOOTNOTES 

1. As the quantity |Phij| increases, the individual dislikes the 
stimulus more 

2. Cf. Chapter VII of [6]. 

3. Personal communication. 
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CHAPTER VII 

ALTERNATIVE CALCULI OF 
SUBJECTIVE PROBABILITIES 

by 

Stefan Vail 
UNIVERSITY OF MICHIGAN 

1. THE NEED FOR AN INVENTORY 
OF ALTERNATIVE CALCULI 

The traditional theory of choice states that an individual's 
behavior under uncertainty may be explained on the basis of two 
intervening magnitudes, utility and subjective probability, and 
that an exact analogue of the mathematical theory of probability 
can handle the problems of behavior under uncertainty. 

If we think that the traditional calculus oversimplifies reality 
and provides a spurious norm, we ought to review other existing 
calculi and other existing norms or invent new, richer but work- 
able concepts with which to axiom at ize theories of choice under 
uncertainty. 

An inventory of alternative calculi will serve to locate the 
exact area of a given experimental or theoretical contribution; 
it may itself suggest further theories and further experiments; 
and it may make some contribution to the metaphysics of proba- 
bility. 

The concepts that we are seeking are not merely "additional 
variables/' "influencing" utility and subjective probability. They 
may have to be entirely different constructs, exhibiting the logi- 
cal properties neither of utility nor of subjective probability. 

2. THE HORIZON OF CHOICE 

This section is a digression. Its purpose is to set apart the 
question of the horizon of choice, which is usually bound up with 
many theories of choice under uncertainty. 1 

When a situation involves numerous alternative choices or 
numerous outcomes, we might find it fruitful to use a concept of 
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"summarization" or "polarization" into a more compact and 
graspable set, perhaps I would venture typically into a dilem- 
ma. Dilemmas arise in fact all too frequently out of logically 
more complex situations, (for instance in multi-party politics, 
3-and-more-person games, arguments, attitudes) so that their 
genesis deserves special attention. This polarization might be 
characterized by two seemingly antithetical axioms: first, that 
the derived set is highly dependent on the composition of the 
parent set and, second, that once the derived set has crystallized, 
choice is independent of the alternatives blacked out. In addition 
we might have axioms to describe which of the candidate alterna- 
tives enter the derived set; for example: I) The derived set con- 
sists of the extreme elements of the parent set; II) The derived 
set consists of two alternatives representing the "average value" 
of the set of favorable alternatives and the "average value" of the 
set of unfavorable alternatives. Either axiom would explain the 
tendency toward the dilemma. 

When to each alternative choice is associated a set of pos- 
sible outcomes say in the form of an interval the ordering of 
the choices presents additional problems. We will need a device, 
such as an indifference map, which permits a complete ordering 
of the alternatives. An indifference map with metric relations 
between indifference-sets is equivalent to a utility function. 



3. AN INVENTORY OF CALCULI 

A calculus of uncertainty is a set of rules governing a proc- 
ess, whether conscious or unconscious, that transforms real or 
imagined uncertainty into decisions. Under the term "decisions" 
are subsumed such diverse activities as sorcery, gambling, and 
the laying down of the principles of a mathematical theory of 
probability. 

We may first divide uncertainty calculi into those involving 
'Vnechanical choice" versus those that embody "genuine decisions." 

3.1. Mechanical choice. Mechanical choice is a very wide- 
spread phenomenon, because it is economical of time and free of 
the agony of indecision. Some of it is conscious, as, for instance, 
the reliance on rules of thumb, magic or sorcery. Unconscious 
mechanical choice is exemplified by carry-on-as-usual activity 
and by behavior in a learning situation. All uncertainty calculi 
tend in the course of an individual's life to be replaced by some 
mechanical process; the crowning example of such a mechanical 
process is an elaborate theory of mathematical probability and 
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the theories of statistical decision which it supports. 

We will leave behind sorcery and the use of rules of thumb, 
merely noting that they seem to be quite widespread even among 
literate, sophisticated people, even when the stakes are impor- 
tant, and even when other calculi (such as subjective or objective 
odds) are available. We possess no predictive theories for this 
sort of behavior. 

Of learning we have several well-developed theories, usually 
cast in stochastic terms, with the two great virtues of being op- 
erational and predictive. These theories of learning are weak in 
two respects: first, they cannot explain the differential effects 
of different outcomes unless we superimpose on the learning 
theory a utility theory; second, learning theories are self- 
contained, that is, they do not predict when and how a given in- 
dividual ceases to act according to the learning pattern, say, 
upon having an insight: e.g. if he thinks the stochastic process 
to which he is exposed is random, if he looks for a pattern, or if 
he suspects foul play. 

The factors likely to upset the stable process of mechanical 
choice are: a new context, a new problem, a new insight, or a 
high degree of realism of the decision situation and the relentless 
approach of a deadline for action. 

We possess no detailed theory (except what can be borrowed 
from the theories of the ''threshold"), concerning the manner in 
which a mechanically choosing individual is forced to make 
genuine decisions. He is presumed to act "as usual" unless 
something "important" has happened, is perceived, and demands 
a new decision. 

Theorists and experimenters recognize that behavior varies 
considerably with the degree of realism of the choice situation, 
and we possess comparative sets of observations to substantiate 
this guess. If we possessed a measure of this effect, in terms 
of, say, the frequency distributions of a certain set of responses 
to a given choice situation we could, of course, use this informa- 
tion to refine or interpret other data. A more interesting project, 
however, would be to determine to what extent people sincerely 
trying are in effect unable to make genuine decisions when they 
don't have to; whether they shift into different sorts of subjective 
probability calculi when they play for keeps, or for chips, in im- 
agination only, or in other values of the degree of realism. 

Formally akin to the degree of realism is the question of the 
deadline or of the revocability of a decision. If t is the interval 
of time between the present and the deadline we might expect a 
priori results analogous to the effects of the degree of realism 
of the previous paragraph. Here, however, we would have more 
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possibilities than in the former case, because nearly every de- 
cision situation has to undergo a sequence of phases with respect 
to the deadline. If there is, as time passes, a tendency to re- 
evaluate the choice, the pattern of the swerving itself will often 
influence the final choice. How often did our deadlines not catch 
us unawares in a last minute antithesis to which we had not 
"really" meant to commit ourselves ? 

3.2. Genuine decision. (A) Subjective probability functions. 
The various concepts to be defined, e.g. "degree of belief," 
"psychological probability" or "perception of frequency" all 
refer to contemplated events, more precisely, to propositions 
concerning contemplated events. 

A point to be made at the outset is that these events do not 
have to be contemplated as occurring in the future. Thus, our 
area of study will include not only the case of a gambler betting 
on a horse to run tomorrow, but also the case of a student who 
answers "yes" or "no" to the question whether "Moscow is to 
the east of Constantinople," and also the case of a scholar who 
decides to accept that "the writer of Shakespeare's plays and 
Francis Bacon were the same person." 

(a) Existence of a belief function. An individual is usually as- 
sumed capable of vaguely ordering outcomes as "more likely" 
and "less likely." And we will have to make different further 
assumptions for the individuals who ascribe to nature passivity, 
constancy, and randomness and for those who animate nature 
into a friend or into an adversary. The individual who considers 
nature passive (likelihoods unaffected by his own aspirations or 
actions) is not necessarily better informed or more rational than 
the superstitious kind, for the former may, for example, fall 
prey to the fallacy that a run of heads must be restituted by tails 
in the next toss, or he may magnify (or simply like better) cer- 
tain odds. The individual who sees nature as an ally or as an 
adversary, though possibly tragically misinformed, may be able 
to make choices that appear perfectly rational when we view his 
situation as he does as a game of strategy. 

(b) Determinateness of the belief functions. Whatever denomi- 
nations people give to odds, "none," "slim," "questionable," 
"even," "fair," "good," "excellent," and so forth, are usually 
postulated as simply ordered and as invariant with respect to 

the constitution of the set of events to which they are ascribed. 
However much haziness veils the boundaries of that coarse 
grained classification, we ordinarily expect asymmetry, and 
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transitivity to be satisfied. In a more general postulation we 
should expect them to be satisfied in probabilistic terms, 2 with 
the requirements becoming less and less rigid as the number of 
alternatives increases. 

In addition to the ordering postulates it might prove useful on 
occasion to postulate concerning "metric relations" that when- 
ever we are given an n-fold classification of odds, it is easier 
to distinguish between classes removed from each other by r 
places if they are astride the equal-odds class, than if they are 
both on the same side of the equal-odds class. Such a "bunching" 
could be used to derive the tendency toward the dilemma, noted 
several paragraphs ago. 

(c) The nomenclature of the belief function. We should not, 
in general, expect the terminology to mean the same things and 
to retain its simply ordered properties over heterogeneous 
events which are unrelated to each other in ordinary decision 
situations, and the same can be said, in particular, about the 
relation that these denominations bear to numerical odds. 



3.3. Genuine decision. 



(B) Probabilities subjective, ob- 
The accompanying table is a sche - 



jective and mathematical. 

matic representation of the relations between the concepts that 

this section discusses. 3 
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The rows bear the labels: mathematical, psychological, objec- 
tive; the columns refer to the number of occurrences of an event. 
The cells are named A through J. Entry J is outside the row- 
column scheme of classification. 

I assume that we understand what is meant by events similar 
in relevant respects; this notion, or the notion of a set, or some- 
thing equivalent is indispensable to the classical approach to the 
mathematical theory of probability. On this basis, suitable defi- 
nitions of a "measure of a set," "ratio of measures," etc. allow 
us to fill cells B and in the table. Let us call this entity n 
or "mathematical probability" or "theoretical probability." 

If we want to make entries in A we will discover that to be 
satisfactory, they have to be defined in terms of the entry /i: we 
seek "relevant characteristics" of the single-occurrence event 
in question, which will enable us to assign this event to one or 
more classes of the kind that underlie fi. 

Let us skip for a while the second row of cells and enter an I 
in cell H. This f stands for the objective relative frequency of 
an empirical event; it presupposes knowledge whether the event 
did in fact happen, a suitable operative definition of time inter- 
vals and an objective method for counting occurrences and non- 
occurrences. It is clear that we cannot fill in cells G and I. 

For the second row of cells one can think of several a priori 
distinct conceptual entities: 

a. The psychological pattern which dictates that we postulate 
such-and-such properties for n ; the thing which makes us con- 
fident that some particular set of axioms for a theory of mathe- 
matical probability provide the best model of the world around 
us; the process which enables even a child to state with assured- 
ness that for a fair coin // (heads) = fi (tails). I call this proc- 
ess a "pro-theory concerning fA ." 

b. Sometimes the faculty for discerning proper values for H, 
("proper" in accordance with the principles provided by the pro- 
theory) is dimmed by haste, by inability to perform complicated 
computations, or by forgetting essential data of the problem. 
For example, it may be, for some people, very hard to say whe- 
ther fi (either a spade, or a king, or a queen, or a jack in a 
single drawing for a pack of 52 cards) is greater or less than, 
or equal to // (a black card in a single drawing from a similar 
pack). 

Let us denote these estimates of u , f ^// f and call them 
"perception of mathematical probability // ." In general 
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will be not only a function of n, but a function of the particular 
event E, whose H is under consideration; it will be a function 
of the wording of the problem, of time allotted to its solutions, 
and so forth. Part of my experimenting (reported in the appen- 
dix) relates to this function ty. 

c. Given an event with objective frequency f, we will call tyi 
the perception of this frequency, or the impression we obtain by 
being exposed to the occurrence of a finite repetitive event. Ex- 
periments by Lawrence, Estes, and Bush and Mosteller deal with 
this entity. 

d. Now it^may happen that an individual has certain confidence 
in the occurrence of an event. This confidence may or may not 
have arisen from a conscious attempt to calculate the fi (if any) 
of the event, or to observe the f (if any) of the event. Let the 
letter /3 stand for this degree of belief, and, let us mark in our 
table 0i, n , 0^ for we have a priori no way of knowing whether 
the same laws apply to all f s regardless of the index. It seems 
that interesting problems in economics (such as business invest- 
ment decisions, occupational choice, large household expendi- 
tures) fall under 0j or n with n a rather small integer. 

A scheme such as the above is too transparently formal to be 
readily acceptable. We must convince ourselves of its useful- 
ness. 

It may be objected to the column-wise classification that the 
classes are not correlative: Column 1 is of a different sort than 
the other columns because statements concerning truly "unique' 1 
events are not statements about probability. This is precisely 
why no symbol associated with our ordinary mathematical theory 
of probability has been entered in Column 1. Truly unique events 
cannot be reduced to intersections of sets of repetitive events. It 
is certain that Column 1 differs very radically from the other 
two in terms of our ordinary mathematical theory of probability; 
but from the point of view of the experience of unsophisticated 
people it is Column 3 that differs most radically in kind, simply 
because it concerns something ideal and beyond experience. The 
column-wise classification was meant as a reference device, 
until we have empirical reasons to believe one or both of these 
distinctions superfluous. 

We may ask, is not the comparison between statements belong- 
ing to Column 1 and those belonging to Column 3 somewhat ir- 
relevant? Coin-tossing, even if it happens only once, still be- 
longs to Column 3; whereas guessing what the capital of Iraq is 
belongs to Column 1. Column 1 contains statements about sets 
containing a unique element for which probability is inapplicable 
almost by definition. 
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Strictly speaking, the tossing of this penny, now, by me, etc., 
is a unique event, a set consisting of one element. Strictly 
speaking, it belongs to Column 1, and if you maintain that it 
really belongs to Column 3, you do so because you happen to 
think that this, now, and me are irrelevant "accidents"; your 
theory of probability tells you to abstract therefrom. Now, it 
would be interesting tQ find whether certain people do in fact 
think of all instances of coin-tossing as so many one-element 
sets while other kinds of people think in more generic terms, 
pooling all such instances into an infinite set belonging to Col- 
umn 3. 

Similarly the distinction between $H and 0f, aside from that 
suggested by purely verbal analogy from the distinction between 
H and f themselves can in certain cases prove of real behavioral 
significance. Because a person equipped with a pro-theory or 
with a theory concerning \JL might or might not be able to obtain 
(by computation, or by any other process allowable by his pro- 
theory or theory) values for // "correct" or "distorted" with 
respect to criteria defined by his theory. Now, quite independ- 
ently of his ability on this score, his perception may or may not 
distort the objectively measurable f f s. 

Whether as a rule people distinguish between the i/7/ and 0f 
in practice so as to justify a hair-splitting task of classification 
should be settled empirically. However, my impression is that 
many people are vaguely aware that observable chance events 
approximate the "true mathematical odds" perhaps only "in the 
long run"; some people are not at all surprised to observe "runs" 
of luck (significant fluctuations in f as a function of the number of 
trials) and yet their confidence in the appropriateness of their 
favorite probability calculus remains unshaken. For such people 
we may say that they do distinguish frequency from probability, 
and it would be reasonable to expect them to exhibit correspond- 
ing differences in behavior. 



3.4. Choice on the basis of /*, 0/K, /3, f or 0f. Now that we 
possess a large assortment of indicators of uncertainty we must 
ask the further question, how are these indicators transformed 
into choice? 

First we must acknowledge the possibility that choice is in- 
dependent of the indicators 1 "intrinsic" value. To the extent that 
people simply like certain odds better than others and choose 
actions accordingly, they fail to utilize the "intrinsic" (say the 
order, the metric, the numerical) uncertainty properties of the 
indicator. If, as Ward Edwards 4 has reported, the phenomenon 
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of pure preference for some odds is widespread, it will over- 
whelm the subtler effects of many true calculi. 

There exist numerous true calculi of indicators of uncertainty. 
Alongside the indicator of uncertainty they carry a second indi- 
cator, of goodness, (utility). All such theories contain an addi- 
tional device, the "indifference map" which establishes "exchange 
ratios" between utilities and probabilities.^ Choice maximizes 
the "psychological product" of these indicators. 

These theories (with the exception of Shackle's, op. cit.) con- 
tain "Boolean postulates" that permit easy reductions of the 
probability of complex (alternative, conjunctive) events into 
simple probabilities, and permit a direct application of the laws 
of mathematical probability to subjective probabilities. 

With these comments I shall leave aside the discussion of the 
traditional two-parameter calculi. They are developed into well 
known axiomatic structures rich in theorems and in normative 
implications. Let us return to our classificatory table. What 
about the column- wise distinctions? 

Does it make any appreciable difference in behavior whether 
the event contemplated is believed to belong to the n = 1, n = 2, 
n = 3, etc. class? 

I think the answer to this is an unqualified yes. The distinc- 
tion between values of n helps make sense out of "nonsensical" 
behavior in most experimental games. Almost all critics of the 
Mosteller-Nogee application of the von Neumann approach to 
the measurement of utility have remarked that the choices made 
by the subjects between the odds offered by the experimenters 
would have been different if the experimenters had warned the 
subjects that their choice would actually be enforced once, 
twice, , n times. It would be interesting to measure how 
fast the inconsistency (upon replication) in the choice "odds A 
vs. odds B" decreases as the number of contemplated plays in- 
creases (provided the odds A and B are not mathematically 
equivalent). We may postulate that for sufficiently large n the 
(objective) probability that a certain individual will adopt a pure 
strategy will exceed a given amount. Believers in reincarnation 
should be expected to approximate pure strategies for quite low 
values of n, because such people expect to play the same game 
a large, or perhaps an infinite, number of times and hence ex- 
pect their long-run luck to approximate the true odds. The con- 
vergence to pure strategy will, of course, be less rapid for be- 
lievers in reincarnation with strong time preference. 
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4, DIAGNOSIS AND MEASUREMENT 

Empirical research in the field of uncertainty has to solve 
three distinct problems: 

(1) Diagnosis how to devise crucial tests that will tell us 
which calculus is used by a given subject in a given circum- 
stance. Without ruling out introspection and interviews, we 
should prefer tests based on the logical properties of the calculi. 

(2) Identification whether it is possible to separate a belief 
function from measurements of a total situation that involves 
other psychological unknowns (say, a utility function). 

(3) Measurementhaving assumed a particular calculus, or 
having diagnosed it and certified its identif lability, how to obtain 
estimates and confidence limits for its parameters, and how to 
test its axioms or theorems. The pilot experiment reported in 
the appendix describes one possible procedure for obtaining a 
measure of the perception of mathematical probability. 



APPENDIX 

The pilot experiments summarized^ in this report attempt to 
throw some light on the following problems: (1) the existence of 
a probability calculus; (2) whether such a calculus has the nature 
of a psychological probability function; (3) the shape of this func- 
tion; (4) factors which cause behavior to deviate from what such 
a function implies. 

Experimental design. The subject was supposed to perceive 
with or without distortion mathematical probabilities A*i> given 
implicitly in terms of propositions Ei. Depending on whether Ej 
occurred, or failed to occur, one or the other of two sides of a 
bet would win. For each bet (about sixty for each value of i) the 
subject said which side he preferred to take. As the stakes and/or 
prizes varied for the same bet, the subject might change his side 
on the bet. Some of these bets were straight bets (x cents to y 
cents), while others involved gains only, and still others losses 
only. If the subject is assumed to maximize his (tentatively as- 
sumed linear) utility for money, weighted by his perception of 
the mathematical probabilities of the E^ the levels over the 
stake-prize field at which he reverses his choice determine a 
real number i^n i (0 < ^ <1) for each i. This is identified as 
the "perception of the mathematical probability pi." 
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Subjects. Four Santa Monica High School students were the 
subjects. They were paid for their time at a flat rate, plus any- 
thing they won from the experimental bets. 

Conclusions. The following conclusions are to be considered 
firm for their limited sample in spite of certain inadequacies in 
the execution of the pilot experiment: 

1. The utility function can without violence to observation be as- 
sumed as linear for the prize ranges investigated Qtf to +13</. 

2. An ordinary bet, gambling without possibility of loss, and 
gambling without possibility of gain give rise to qualitatively 
different responses and contradict the hypothesis that choice 
is based on a simple psychological "product" of utility and 
the perception of probability. 

(a) If the choice is between evils, the lesser loss is chosen 
regardless of the relative odds of the two evils. 

(b) If the choice is between two positive prizes, the surer 
prize is chosen regardless of the magnitudes of the two 
prizes. 

3. A function ^ can be derived for straight bets with a high 
level of significance for each subject. 

4. The functions thus derived agree in three out of four cases 
with independently derived metrics over the continuum JLL, 
and with independent tests in which the subjects rank-ordered 
the events EI according to their probabilities. 

5. All subjects underestimate probability 1/6, two overestimate 
probability 1/4, and three overestimate very markedly prob- 
abilities exceeding 1/4. 



FOOTNOTES 

1. Examples: (a) the postulates restricting the commodity field 
over which utilities at any one time are defined, N. Georgescu- 
Roegen: "The Theory of Choice and the Constancy of Economic 
Laws," in Quarterly Journal of Economics, LXIV (Feb. 1950); 
(b) the gambler-indifference map, G. L. S. Shackle, Expectation 
in Economics, Cambridge, 1952; (c) the minimax principle in 
games and in statistical decision functions. All these are special 
kinds of functions mapping what I call the parent set onto the 
derived set of alternative courses of action. They have nothing 
to do with uncertainty. 
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2. See, e.g., Karl Menger's "Probabilistic Theories of Relations' 1 , 
Proc. Nat. Acad, Sci U.S.A., 37 (1951), 178-180. 

3. This scheme of classification has its origins in C. H. Coombs, 
"The Measurement of Psychological Probabilities and Their In- 
teractions with Utility for Consequences and for Risk", Working 
Memo 2, Michigan Project, July 9, 1952 

4. "Probability Preferences in Gambling Behavior", Amer, J. 
Psycholo, 66 (1953), 349-364. 

5. It is not usually realized that this conception leads to an ob- 
jectionable logical regress: for in addition to the utility which, 
along with probability, defines the field of indifference, we must 
have second-order super-utilities, more "ultimate" than the 
first, corresponding to the contours of the indifference map. 

6. Full discussion of these experiments will be found in Working 
Memos 15, 27, 33 and 35 (hectographed) of the Seminar on the 
Design of Experiments on Decision-Making Processes, Santa 
Monica, 1952. 



CHAPTER VIII 

A FORMAL STRUCTURE FOR 
MULTIPLE-CHOICE SITUATIONS 

by 

Robert R. Bush,* Frederick Mosteller, * Gerald L. Thompson* 
HARVARD UNIVERSITY, DARTMOUTH COLLEGE 

1. INTRODUCTION 

A large class of problems in experimental psychology in- 
volves choice situations. An animal or person is periodically 
presented with a choice of alternative courses of action, e.g., a 
rat can turn right or left in a maze, or a person chooses one of 
several answers on a questionnaire as the "correct" one. The 
possible choices are ordinarily well defined in an experiment. 
In many such experiments on conditioning and learning the 
same choice situation is presented to the subject repeatedly for 
many trials and the psychologist is interested in the sequence of 
choices actually made by the subject. 

When one looks at the data from learning experiments in- 
volving choice situations, systematic changes in the behavior of 
the subject are usually observed. These changes take place, one 
assumes, as a result of the treatment used by the experimenter. 
For example, the experimenter may reward one response and 
punish another. The rewards and punishments may assume 
various forms in different experiments a pellet of dog chow 
may be rewarding to a hungry rat, but a nod of approval may be 
equally effective for an M ego-involved M human subject. The 
events which alter behavior may be the mere execution of a re- 
sponse, i.e., a "practice effect" may be present in some experi- 
ments; there may be a tendency to be consistent. In a rather 
general sense, then, what characterizes the experiments we are 



*The work of the first two authors named was partly supported 
by the Laboratory of Social Relations, Harvard University; the 
work of the third was done in part at Princeton University with 
support of the Office of Naval Research. Seniority of authorship 
is not implied by the order of the names. 
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discussing is a set of possible courses of action and certain 
events which alter the subject's tendency to choose those 
courses of action. In the mathematical system we are about 
to present, these two notions will play a central role. 

The work to be described below is an outgrowth of the work 
by Bush and Mosteller on mathematical models for learning 
[1,2,3] and related work by Estes [4], Miller and McGill [6] and 
Flood [5]. In this paper no attempt will be made to apply the 
mathematical system to experimental data. Instead, we have 
chosen to present the mathematical structure in as general 
terms as we consider feasible and leave the applications to 
future papers. The reader is referred to the literature for a 
few applications of special cases [3,4,6]. 



2. THE ELEMENTS OF THE SYSTEM 

As primitives in the mathematical system we choose a set ou 
of alternatives and a set o'of outcomes. The set ^ will contain 
r mutually exclusive and exhaustive alternatives Aj, and these 
alternatives will correspond to classes of responses of real or- 
ganisms in an experimental situation. In an experimental study 
of the behavior of rats in a simple T-maze, for example, we 
would distinguish two alternatives: turning right (Aj) and turn- 
ing left (A 2 ). We assume that the set a/ is presented to the or- 
ganism on each of a series of trials and that on each trial the 
organism chooses or performs one of the Aj. Whenever an al- 
ternative Aj occurs it will have some outcome O^ and Or is the 
set of the s possible outcomes. In general, an outcome O^ will 
correspond to some change in the organism's environment, a 
stimulus change, a reward or punishment, but we will include 
no change as a possible outcome. In the T-maze experiment 
mentioned above we would distinguish two outcomes: finding 
food (Oi) and not finding food (02). 

In terms of our two primitives, alternatives and outcomes, 
we will define what we shall call events. Corresponding to each 
alternative-outcome pair, (Aj,Ofc) we will have an event Ej^. We 
denote the set of the Ej^ by . A more general formulation 
would include events EI which are not identified with alternative- 
outcome pairs, but we will not consider such a generalization 
here. We further define a set IT of conditional probabilities TTJ^. 
If alternative Aj occurs on trial n, then TTJ^ is the (conditional) 
probability that outcome Ofc occurs and hence that event Ejfc 
occurs. We will assume that these TTJ^ are constant; they will 
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correspond to probabilities which the experimenter controls. 
Since some outcome must follow each alternative, 

(1) E *Jk - 1 

k=l 



It is convenient to regard all O^ as possible outcomes of every 
alternative Ajj if an O^ cannot follow Aj, then TTJ^ = for such 
impossible events. The above described concepts characterize, 
for our purposes, the environment of an organism, i.e., repre- 
sent an abstract experimental situation. In fact we shall call 
[fa, 6-,7r] a representation of the experimental situation. 

We now need to ascribe certain properties to the organism. 
For any trial n (n = 0,1,- ) we let the organism possess a 
probability vector p(n) whose components are the probabilities 
of occurrence of the alternatives Aj on trial n. Some event Ejk 
will occur on trial n and this event will change the probability 
vector to p(n+l). For each event Ejk we define an operator 
such that if Ejk occurs on trial n, 



(2) p(n+l) = T jk p(n) . 



We then assume that the operators Tjk are linear and so we 
represent them by rxr stochastic matrices. For our purposes 
a stochastic matrix has non-negative elements, and columns 
which separately sum to unity. (Usually it is the rows that sum 
to unity, but we are using a different convention.) The vectors 
p(n) will be written as column vectors and so the Tjk operate 
upon the p(n) from the left. We next place some further re- 
strictions on the operators 



3, THE COMBINING OF CLASSES 

Because the identification between real classes of behavior 
and alternatives in the model is, to a certain extent, arbitrary, 
we should like to have the behavior of the organism be invariant 
under changes in these identifications. In this section we shall 
formulate the ''combining classes" condition and then give some 
interpretations. When this condition is imposed, it will be pos- 
sible to combine alternatives that have the same set of outcome 
probabilities without altering the basic features of the system. 
For example, if rats are being conditioned to bar- pressing, one 
might define two classes of bar-pressing; presses from the 
right and presses from the left. If these two classes were to be 
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"treated" the same way, i.e., were to have the same conse- 
quences for the rat, then we would like to be able to combine 
these classes on any trial without changing our behavioral pre- 
dictions. We will now make this idea precise. 

We shall consider two representations of the same abstract 
experiment and shall denote these by [<^, ,TT] and [<z*, *>*]. 

Let & = A , A. be a subset of <^ 



Define a composite alternative A o to be chosen if and only if 
one of the alternatives Aj in CL^ is chosen. 

Let #*= |A } U (&- # ), i.e., the set consisting of all alter- 
natives Aj which are in cz but not in <^ o together with the al- 
ternative AQ. Since the alternatives Aj are mutually exclusive 

the probability of A o being chosen at the nth trial is 
(3) Pr |A } = p^di) + + pj h (n) = p (n) . 



We should like to define the composite event E O ^ = (A< 
to occur if and only if one of the events Ej k = ( A j >Ofc) occurs, 

but this leads to a further consideration. Note that the uncondi- 
tional probability that one of the Ej k occurs at trial n is 
h y 

i p jy( n ) ^jyk In order that A could be treated as an or- 
dinary alternative, this expression would have to be equal to 
p o (n) times the probability that Ok occurs following the choice 
of alternative A o . But the latter probability depends on which 
alternative Aj has been chosen given that A o is chosen. Hence 
the concept "the probability of Ok given A o " is well defined 
only if TTJ k is a constant for all y and for each k. Let 6* = 

{ (A o , Oj), , (A0, O s )} U { (Aj, Ok) | Aj e (d - &Q)\ that is, 

each of the composite events E o k together with the events which 
are not composite events. We shall say that [U, ,TT] and 
[#,*, 6*, TT*] are equivalent representations of the same abstract 
experiment if and only if nj^k = ^ o k for y = 1,- , h and each 
value of k, that is, if and only if the alternatives Aj receive 
the same treatment. 

In most experimental arrangements of interest to us the con- 
ditional probabilities ?rjk are at the disposal of the experimenter. 
Therefore, we assume that any two representations of an ab- 
stract experiment can be made equivalent by an appropriate 
choice of the Tnt. 
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In the representation [&, ,TT] of the experiment there are r 
alternatives and in the representation [<* , *,?r*] of the experi- 
ment there are r - h + 1 alternatives. Consider, for example, 
the case in which the first (r-1) alternatives are combined, i.e., 
CL O = { A},- -,A r _i}, CL*= |A Q , A r | . In this case the proba- 
bility vector 



(4) 



/"A /PO\ 

I I I is replaced by the vector I J. 
\Pr/ \Pr/ 



This change can be thought of as a linear transformation of the 
r-dimensional probability vectors (which form an (r-1) dimen- 
sional simplex Z r .j, the space over which the probability vec- 
tor can vary after the restrictions Zpj = 1 and O p j are 
imposed) onto a one-dimensional sub-simplex Ej of itself. 
This is a linear transformation which sends 





(5) [ I j onto the vector 



where we have arbitrarily placed p o in the first row (it could 
have been placed in any one of the first r-1 rows). The only 
difference between (4) and (5) is the addition of r-2 rows of 
zeros in the second vector of (5) and the correspondence (iso- 
morphism) between the two is obvious. For this example, the 
matrix of the singular linear transformation (5) is given by 



I 1 1 O 



/ 


00 


\ 


...00 


\0 


..oi 



(6) C = 



The mathematical term for this kind of a linear transformation 
is projection. 

To each pair of equivalent representations of an abstract ex- 
periment there is a projection matrix C similar to the one shown 
in (6). For each operator T^ define the projected operator 
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T*jk = C Tjfc, and for each probability vector p define the pro- 
jected probability vector p* = C p. The star operation is 
equivalent to left multiplication by a projection matrix C. 

We shall say that the operators Tj^ (abbreviated T in the 
following) satisfy the combination of classes requirement if and 
only if, for each pair \tu> ,11] and [CL*, *,TT*] of equivalent 
representations of the experiment with associated projection 
matrix C, each operator T satisfies 

(7) T*p* = (Tp)* that is, (CT)C p = C(T p) , 

for all p in 2 r _i. Since matrix multiplication is associative 
this condition will hold for all p if and only if 

(8) CTC = CT. 

We wish to have the operators Tj^ independent of the ?rjk and 
so we are at liberty to choose the TTJ^ so that any two repre- 
sentations of an experiment are equivalent. Hence we require 
that equation (8) be satisfied by all projection operators C. 
The foregoing arguments are summarized in the 

Definition: A stochastic matrix operator T satisfies the 
combination of classes requirement if and only if equation (8) 
is satisfied for all projection matrices C. 

To see what restriction this places on the operators, let 

be the matrix of an operator T, and consider the projec- 
tion matrix C in equation (6). Then computing the right and 
left sides of (8) we have: 

r-1 



CT = 
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CTC = 





These two matrices will be equal if and only if t r( $ = t r j for 
6 = 1, , r-1. When t r $ = t r j for 6 = 1,2, , r-1, the last 
rows are identical at once, and the first rows are identical be- 
cause each column must sum to unity. If we take the other sub- 
sets of r-1 alternatives and combine them we will obtain pro- 
jection matrices similar to the one in (6) and applying them to 
equation (8) we find that in order that any subset of r-1 alter- 
natives can be combined it is necessary that the ty^ should be 
equal for all 6 4 V, i.e., 

(9) t y = t y for y ^ (J, y,<J = 1,2,- , r . 

Conversely if (9) is satisfied then any subset or r-1 or fewer 
alternatives can be combined in the desired way. 

These results imply that when the combining classes condi- 
tion is satisfied we can write the matrix of any operator T as 
follows: 

2-^ t\/ ti o . . 



- E t 



\ 



tr t, 



(1- t y )I 
y=l 



V 



^ t r . 



1- E 



'A 
t2 



tr/ 
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where I is the r x r identity matrix. Setting a = 1 - ty 

and Xy = ty/(l-cr), for a ^ 1, we have 

/ Xl 

T = a 1 + 

\ 
Xj. Xj. o . . Xj. ' 



(Xj Xj ... Xj \ 
o 1 
X A- X~/ 



r 

Note that X y = 1 and that X y and therefore X is itself 
y=l 

a probability vector. Let A be the r x r matrix, each of whose 
columns is the vector X. This leads to 

Theorem 1, If a stochastic matrix operator T satisfies the 
combination of classes requirement, as given by equation (7) 
above, then T can be written in the form 

(10) T = a I + (l-a)A . 

From this theorem we immediately obtain the following three 
corollaries: 

Corollary 1 If a < 1, then X is the unique fixed point of 
both A and of T, while if a = 1 then every probability vector 
is a fixed point of T. 

The first part of this corollary follows from the fact that for 
any probability vector p we have A p = X, because the compo- 
nents of p sum to unity and all the entries in a row of A are 
identical. Moreover, if a < 1 then T p = p if and only if p = X, 
When a = 1 then from equation (10) above we see that T re- 
duces to the identity operator I. 

Corollary 2. When an operator T satisfies equation (10) 
above then the nth iterate of T is given by 

(11) T n = <*ni + (l-an) A 

and when in addition | a \ < 1 then p(n) = T n p(o) converges (in 
the topology of r-l) to X. 

Equation (11) follows from the fact that A n p = X for 
n = 1,2, which is a consequence of Ap = X and AX = X . 
The convergence for I a I < 1 is immediate from the definition 
of p(n) and equation (11). 
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Corollary 3. Two operators TI and T which satisfy 
TI = 0^1 + (l-Oj)Ai have a commutator TjT2 - T2Tj given by 

(12) 



and TI and T 2 commute if and only if a i = 1 or a 2 = 1 or Tj 
and T 2 have the same fixed point. 

It is readily shown that AI A 2 = Aj and A 2 AJ = A 2> and then 
a simple computation gives equation (12). The two operators 
commute if and only if their commutator is the zero operator 
and the three conditions which cause this to be so may be seen 
immediately from equation (12); when AI = A 2 then \i = A 2 . 

In order that the operator T in equation (10) should have 
non-negative elements the parameters a and A must satisfy 
the inequalities < a + (l-a)Ai 1 and (l-a)Ai < 1 for 
all i. Since the AJ are non-negative, the above inequalities re- 
quire that a satisfy 



(13) max I T-^J ) ^ a <, 1 . 




[if some Ay = 1, then A y /(Ay-l) is to be interpreted as - <x>.] 

If p is any probability vector and T is a matrix of the form 
(10), then Tp = a p + (l-a)A. If a is non-negative then p is 
sent by T to a point Tp on the line segment between p and A 
It can be shown that the only linear operators which possess this 
interpolation property are of the form (10). We shall insist upon 
this property for all events Ej^, Ue , for all j and k, 

(14) Tj k = a jk I + (l-crjk)Ajk . 

Another interpretation of the combining of classes restriction 
is the following. If p is any probability vector and C is any 
projection operator of the form (6) we have the following dia- 
gram: 



[Tp] x = T" p*. 



Hence if p(0), p(l), p(2), is any sequence in the representa- 
tion [fl/, t,ir] and p(0)*, p(l)* , p(2)*, is any sequence in the 
representation [&*, ?,TT*] then we have 
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P(0)- 



Hence if the combining of classes condition holds or if each of 
the operators is as in (14) then any sequence in the representa- 
tion [<* , *,TT*] is a projection of a sequence in the representa- 
tion 



4. TRAPPING THEOREMS 

Now that we have developed the general form of the event 
operators, given by equation (14), we will show that the proba- 
bility vector p(n) is contained within specified regions in the 
limit as n-*oo. Because a vector cannot be moved out of these 
regions by any of the Tjfc, we call the statements about the re- 
gions the "trapping theorems." In the first part of this section 
we shall consider the operators to be indexed by a single sub- 
script i; thus we have operators 

TI = o?i I + (1-04) Ai , 

where i = 1, t. For any subset X of r -i hereafter de- 
noted by r, let Ti(X) be the set of images of points of X after 
one application of the operator T[. The following lemma ex- 
hibits the properties of the operator TI. (For simplicity we 
drop the subscript i in the statement of the lemma.) 

Lemma. If T = a I + (l-a)A and a? ^ then T has the 
following properties: 

(a) T is one to one and continuous, 

(b) T sends closed (open) sets into closed (open) sets, 

(c) T sends hyperplanes onto parallel hyperplanes. 

If a as then T maps every vector onto the vector X. 

Proof, (a) If Tp = Tp 1 then ap + (l-a)X = ap f + (l-a)X so 
that a(p-p f ) a 0. Since a ^ we have p = p f . The continuity 
is obvious. 

(b) The inverse mapping to T is T"* = - I + (l--)A 

which is continuous and therefore T sends closed sets into 
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closed sets. A similar statement holds for open sets 

(c) Let h = (hi, . , h r ) by a row vector, x a (column) 
probability vector and d a real number. Then consider the 
hyperplane H = {x | hx = d}. We have, for any point x on H 

h(Tx) = a hx + (1-a) h A = ad + (1-a) h X = d f 

where d' is a real constant. Then T(H) is the hyperplane 
H' = {x | hx = d 1 } and so H 1 is parallel to H. 

If a = then T = A and Tp = Ap = \ for all p in Z . This 
completes the proof of the lemma. 

It follows as a corollary to this lemma that the image under 
Ti of any polyhedron is again a polyhedron. In particular Tj(S) 
is a simplex with faces parallel to the faces of Z. 

If o?i = 1 then Ti is the identity operator and Ti(Z) = Z; 
hence this case might be interpreted as no learning. If 04 = 
then, as seen above, Ti(Z) = \i, a point in Z, and so this case 
might be interpreted as one trial learning. If < cr i < 1 then 
each point p in Z is moved by Tj to an interior point on the 
line segment between p and \i, so that this might be considered 
uniform learning; the smaller a j, the more rapid the learning. 
If a i < then each point p in Z is moved by Ti to a point on 
the line through p and \\ which lies on the opposite side of \i 
from p. 

In Figure 1 we illustrate graphically the sets Z, T(Z) and 
T 2 (Z) for the case r = 3. In Figure l(a) the operator2 is Tp = 





(a) 



(b) 



Fig. 1. The simplex Z and its first two images, T(Z) 
and T 2 (Z), for the case of three alternatives (r = 3). In 
Fig. l(a) we have taken a = 1/2 and A = (1/3, 1/3, 1/3); 
in Fig. l(b), a = -1/2 and X = (1/3, 1/3, 1/3). 
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1/2 p + 1/2 (1/3, 1/3, 1/3), that is, a = 1/2 and X = (1/3, 1/3, 
1/3). In Figure l(b) the operator is Tp = -1/2 p + 3/2(1/3, 1/3, 
1/3), i.e. a = -1/2 and X = (1/3, 1/3, 1/3). 

Regardless of the choice of the initial vector p(0) in Z we 
can say that after the first step in the process 

(15) p(l) e (J Ti(2) = S^ (set-theoretical union). 



Sj is closed since by the lemma each Tj(Z) is closed and so Sj 
is the union of closed sets. 

A possibly weaker statement than this one is 



t 



(16) 



p(l) \J Ti(2) = Cj (convex union), 



where the symbol W T^(Z) means the smallest convex set 
i=l 

containing the union of the sets Tj(S). Since this union is 
closed, GI is closed. Figure 2(a) shows the set 8j and 
Figure 2(b) the set C^ for an example with r = t = 3 and two 
of the a i positive and one negative. 





(a) 



(b) 



Fig. 2. The set- theoretic union, Sj, and the convex union, 
GI, of the sets Ti(Z) for r = t = 3. Two of the oj are 
positive and one is negative. 
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The corresponding statements at the end of the nth trial are: 



(17) p(n) \J T^Sn.i) = S n 

t 

(18) p(n) c y Ti(C n .i) = C n . 

Sn and C n are closed for the same reasons that Sj and Cj 
were closed. The sets S n and C n provide bounds on the loca- 
tion of the probability vector on the nth trial. If the a i are 
small in magnitude these may be quite useful bounds. 

Since each | o?i | < 1, the sequences of sets which obtain, 
namely {S o , Sj, S 2 , } and |C O , Cj, C 2 , } where 
S o = C = Z, are monotone decreasing and have limits* Hence 
we define 

oo 

(19) S = lim ^ = fl ^ , 

n > oo n=o 



(20) C = lim C n = 

n -> n=o 

Note that both S and C are closed sets since each member of 
each sequence is closed. Since C n D S n for every n, CDS. 

We next define recursively two more sequences of sets { R n } 
and { B n } with which we shall be concerned. Let 

t 

(21) RQ = V Xi = B , 



t 
(22) Rn = U 



(23) B n = V T i< B n-l). 



Since Ti(Xi) = Xj the sequence { B n } is monotone increasing so 
that we can define 

(24) B = lim B n = (J B n . 



n "* w n=o 
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In this case B need not be closed. For example, let r = t = 2 
and consider the two operators, Tj and T2, where TIP = 

- ip + | (1/3, 2/3) and T 2 p = - ^p + |(2/3, 1/3). It is not 

hard to show that for these two operators B is the one- 
dimensional simplex of probability vectors except for the 
vectors (1,0) and (0,1), and is therefore an open set. 

The sequence {RQ, Rj, } is in general not monotonic. 
The following example^ shows that this sequence need not con- 
verge. Let r = t = 2 and consider the following two operators: 
TI(P) = (1,0) and T2(p) = -p + 2(1/2, 1/2). It is not hard to 
show that the limit inferior of the sequence {R n [ consists of 
three points and the limit superior of the whole simplex 2j. 
Hence we define 

oo oo 

(25) R = lim sup R n = P| U ^ 

n -" m=l n=m 

R need not be closed for the same reason that B was not 
necessarily closed. 

With this notation we can state the "weak" trapping theorem. 
A slightly stronger version of it will be given later (Theorem 4). 

Theorem 2. If |a?il< 1 for i = 1, , t then C = B (where 
B is the closure of B). When (*{ < 1 for all i, then C = 



Proof. If all 04 are zero then Cj = C 2 = = C = B o = 
BI = 62 = = B, and the theorem is trivially true. Hence we 

assume that at least one o?i is non-zero. Let a - max \a\\ ; 

i 

then < a < 1. We also assume that B o C (proper contain- 
ing) or there is nothing to prove, because if B o = 2, then B n = 
C n = S for n = 1, 2, . 

First we show that C 2 B. We know C o 2 GI 2 2 C n 
2 and B c B! c . . . c Bn c . . . . Since C o 2 B it 
follows by induction that C n 2 B n . Let m n; then C n 2 
c m 2 B m . Hence for every n, C n D B m for all m n. Fur- 
thermore C n 2 B m for 1 < n, and so C n 2 )jj B m = B for 
every n; therefore C = n C n 2 B, and since C is closed 
C 2 1. n= ;_ 

Next we show that C c B. We will do this by first showing 
that as n gets large, the extreme points of C n get arbitrarily 
close to B, and then using convexity. Since each C n is a convex 
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polygon it is spanned by a finite number of extreme points. Let 
V-l* * * "> ^r be * he extreme points of C o = E. Then since the 
operators TI are linear, Cj will be spanned by a subset of the 
points M(!) = Ti(jutj) for i = 1, , t and j = 1, , r . In gen- 
eral, C n will be spanned by a subset of the points /i(n) = Tj * 
Tjj ()ULj) for i n , ' , ii = 1, , t and j = 1, , r. 

Observe that 

(26) T ln Tij (2) : 

* 

Let 6 be the diameter of Z. (The diameter of a set X is the 

longest line segment that it contains; symbolically 

6 = g.l.b d(x,y), where d(x,y) is the distance between x and y.) 

x,yeX 
Then the diameter of Tj Tj. (2) is < d a n which can be 

made arbitrarily small by taking n large. 

Since each of the r points on the right hand side of (26) be- 
longs to B we have d[jLt(n),B] < diameter of Tj n T^ (Z) < 

6 a n . Hence the extreme points of C n get arbitrarily close to 
B. Since the extreme points of C are not necessarily those of 
any C n and are not necessarily finite in number, we introduce 
the sets E n = { z |d(z,B) 6 Qf n , with z e s}. Since C n is con- 
vex the above argument shows E n 2 C n and we already have 
shown that C n 2 B. Since d o? n - the sets E n are monotone 
decreasing and we have 



(27) lira E n = p) E n = lira E n = C = B 

n oo j n * oo 

which is the first part of the theorem. When < 04 < 1 we 
have B O = BI = - = B = B and the second part of the theorem 
follows. 

The weak form of the trapping theorem was concerned with 
the sets C and B. Correspondingly, the strong form of the 
trapping theorem is concerned with the sets R and S. 

Theorem 3. If a i < I for i = 1, ., t, then S = R, where R 
is the closure of R. Moreover, if at least two of the \i are not 
equal, and if 04 4 for all i, then S is a perfect set.4 

Proof. From the definitions we have S n 2 R n and, since S n 
is a decreasing sequence Sn 2 R m for n <; m. Hence 
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(28) S = H Sn 2 p) R m = R ' 

n=l n=l m=n 

Since S is closed we also have S 2 R. 

Suppose x S, then x S n for every n. Then for each n 
there exists a set of integers i n , -, ij such that 
x e TI TI (2). By (26) and (21) we see that Tj -T 



contains points of R, namely the Tj '^..(Ai). Hence 

d(x,R) < 6a n for every n, i.e., x e R. This proves S = R. 

If a i ^ for all i and at least two of the \j are not equal, 
we must show that S is a perfect set, i.e., we must show that if 
x is a point of S then it is a limit point of S. Let U be an open 
spherical neighborhood with center at x and diameter d. To 
show that x is a limit point of S it is clearly sufficient to show 
that every such neighborhood contains points of S other than x. 
As before, let a = max |o?i I and let 6 be the diameter of S. 

Choose an N so large that 6 a n < d/2 for n > N. Now 

x e SCS n for every n, and Sn consists of the union of sets of 

the form TI Tj (S). Choose n > N; then for some 

*n> * * > *1 we h ave x e TI Ti-(2)C U, since the diameter 
of T in - Ti 1 (E) is at most 6 a n < d/2. Now T in T il (E) 

contains points of S, namely the images of the points Xj,- - , \ 
under these operators. Since at least two of these points are 
distinct and since the lemma implies that the operators are one 
to one, the neighborhood U contains at least two points of S. 
Hence the set U - { x } contains at least one point of S and there- 
fore x is a limit point of S. 

We return now to the double-subscript indexing of the op- 
erators and drop any assumptions on the a f s. Recall that, if 
on the nth trial, alternative j is chosen and outcome k occurs, 
then the operator 

(29) Tjk = <*jkI+(l-<*jk)Ajk 

is applied to the probability vector p(n). If a jk = 1 then Tjk 
is the identity operator; hence if < TTjk then there is positive 
probability that the extreme point MJ = (0, , 0,1,0, , 0) 
(the 1 occurs at the jth place) will not be moved on that trial. 
For the operators Tjk with a jk < 1, or a?jk = 1 and TTjk = 0, 
Theorem 2 still applies. We can restate Theorem 2 as follows: 
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Theorem 4. Consider the following sets. 

(a) Let W be the convex set spanned by the vectors Mj for 
which there is an operator Tjk with a jk = 1 and < m^. 

(b) Let B be the set defined as in equations (21), (23), and 
(24) where the operators Tjk considered are those for which 
o?jk < 1, or for which a jk = 1 and ?rjk = 0. 

(c) Let C be defined as in equations (16), (18), and (20). 

Then C = BUW. When a t < 1 for all i then C = B O UW. 

Verbally this theorem states that asymptotically the proba- 
bility vector will be "trapped" either in the convex subspace W 
or in the convex subspace B. Howevejr, with probability 1 the 
probability vector will be trapped in B. 

By similar reasoning one can restate Theorem 3 as follows: 

Theorem 5. Consider the following sets. 

(a) Let W be defined as in Theorem 4. 

(b) Let R be defined as in equations (21), (22), and (25) for 
those operators Tjk such that a jk < 1 or such that tfjk = 1 
and fljk = 0. 

(c) Let S be defined as in (15), (17), and (19) for the same 
set of operators as in (b). 

Then S = RUW. Moreover, if a jk ^ for each j and k, 
and W contains more than one point, then S is a perfect set. 

Again the probability vector will be trapped in R with 
probability 1. 

Negative values of a have been of little interest in experi- 
mental applications. If the a's are non-negative then Theorem 
3 can be modified as follows: If the a f s are non- negative and 

a jk + <* j'k f < * for a11 J> k > J f f and k? > and if the A J k span an 
n-dimensional convex set, then S is a (generalized) n-dimen- 

sional Cantor set. 

The strength of the above theorems depends upon the size of 
the parameters a jk and the location of the fixed points of the 
operators. If the a f s are small and the fixed points lie in the 
interior of the simplex S then the theorems are very powerful, 
but if the a's are close to unity and the fixed points are at the 
extreme points of the simplex then the theorems (especially 
Theorem 2) may very well be vacuous. It is also clear that 
Theorem 2 could be obtained as a consequence of Theorem 3 but 
that the proof is no easier than the independent proof given. 
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5, THE EQUAL ALPHA CASE 

In Section 3 we restricted the operators Tj k to the form 
given by equation (14). We now introduce another major re- 
striction on these operators as a special case. We require that 

(30) a jk = a, j = 1, 2, r; k = 1, 2, s . 

This restriction greatly simplifies further mathematical con- 
siderations. Furthermore, the resulting model is of interest in 
some psychological applications. 

We will now discuss some of the sequential aspects of the 
problem. We begin with an initial vector p(0). On trial we 
will apply one of the rs operators 

(31) T jk = a I + (l-a)A jk 

to p(0) to obtain a vector p(l). On trial 1 we will again apply 
one of these operators, etc. On trial n, then, there will be (rs) n 
possible (not necessarily distinct) vectors p(n). We shall label 
these possible vectors with a subscript i = 1,2, -, (rs) n . Cor- 
responding to each such vector pi(n) will be a sequence of vec- 
tors for trials to n. Let the probability of occurrence of the 
ith sequence be Pj n . Therefore, on any finite trial n, the 
pi(n) form a multivariate distribution in the original r-dimen- 
sional space and Pi n is the density associated with pi(n). The 
jth component of these vectors pi(n) will have a marginal dis- 
tribution, and this marginal distribution will have a mean value, 
called the jth marginal mean. The r marginal means will form 
a vector V n . Because of the linear- combination properties of 
vectors, V n is the same as the mean vector defined by 

(rs) n 

(32) V n = P in 



With this definition and the operators Tj k we can develop a re- 
cursive formula for these vectors of marginal means. On trial 
n, one of rs events Ej k occurs. The probability that Ej k oc- 
curs is pij(n) 7Tj k where Pij(n) is the jth component of Pi(n), 
i.e., is the probability of occurrence of alternative Aj on trial 
n when the ith sequence has occurred, and where ?rj k is the 
conditional probability of outcome O k , given Aj. The vector 
of marginal means on trial n+1 is then 

(rs) n / r s \ 

(33) V n +! = P in <Z E Py(n) 7r jk T jk Pi (n) > . 
i=l (j=l k=l ) 
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Now when we apply Tj k of equation (31) to Pi(n) we have 

(34) Tj k pi(n) = a pj(n) + (1-a) X jk . 

Using this result in equation (33) and recalling that 7Tj k = 1 
and pj j(n) = 1, we have 

(rs) n / r s , 

(35) V n+1 = P in a Pi (n) + (1-a) p y (n) 7r jk X jk . 

i=l ( j=l k=l 

We then define an average vector Xj by 

_ s 

(36) Xj = 7Tj k X jk 

k=l 

and consider the sum ]C Pij(n) Xj. If we form an r x r matrix 

A by letting its jth column be Xj, we have 

r _ 

(37) pij(n) Xj = Api(n) . 

Thus, equation (35) becomes 

(rs) n , 

(38) V n+1 = P in {a Pi (n) + (1-a ) A Pi (n) J 

but from definition (32) we get 

(39) V n +! = a V n -h(l-a) A V n . 



Hence, we_can obtain the vector V n ^i by applying to V n an 
operator T defined by 

(40) f = a 1+ (1-a) A. 

This expected or average operator T will have a unique fixed 
point x provided that the equation Tx = x has a unique solution. 
But we see that this requires that 

Tx = a x 4- (1-a) A x = x , 
or 

(41) Ax = x. 

Thus, T will have a unique fixed point if and only_if A has a 
unique fixed point. Moreover, the fixed point of T, if it exists, 
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is the fixed point of A . The requirement of a unique fixed point 
is identical to the requirement that the vector equation 

(42) ( A - I) x = 

have a unique non-trivial solution x, and this in turn requires 
that the matrix (A - I) have rank r-1. Moreover,if T has a 
unique fixed point z and if x is any point in 2 r -i then the point 
x(n) = T n x will converge (in the topology of S r- i) to z. 

For some purposes it is convenient to write out the compo- 
nents of the vector of marginal means. Let us denote the ^th 
component of the fixed point by V^^ and the Vth component of 

by A^. Then equation (41) gives 



j=l 

or using the definition (36) 

r s 

(44) V V TT-, 

v"/ L^ L~i "I] 

j=l k=l 
These equations, along with the condition that 

r 

(45) V =1 

.7* >Jx 
2^=1 

may be solved for the asymptotic marginal means, V^ ^. 

We now place one further restriction on the Ajk ^ which 
seems reasonable in experimental applications. We assume 
that if all alternatives are treated alike, i.e., if 

j = 1,2,- , r 

(46) 7Tj k = TTfe 

k= 1,2, , s 
then the asymptotic marginal means are all equal, i.e., 

(47) V 00>j = i J = l,2,.-. f r 

This restriction simply means that if all alternatives have the 
same set of outcome probabilities, then, on the average, no al- 
ternative will be preferred to any other in the limit. We further 
assume that the Xjk do not depend upon the njk, i.e., the limit 
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point of any operator Tjk is independent of the conditional 
probabilities of applying that operator. When we introduce 
equations (46) and (47) in (44) we have 



(48) 



r s 

E E 

=l k=l 



We want this equation to hold for all values of the 7r k consistent 
with the condition that 



(49) 



and so we must have 



k=l 



7T k = 1 



(50) 



This condition is useful in applications where the alternatives 
are symmetrically defined, e.g., when the experimental design 
is such as to eliminate effects of position preferences. 



6. THE EQUAL ALPHA CASE WITH TWO ALTERNATIVES 
AND TWO OUTCOMES 

In this section we will apply the analysis of the preceding 
section to the case of two alternatives, AI and A 2 , and two 
outcomes Oj and 03, to illustrate how the mathematical system 
might be used to describe a simple experiment. We shall use 
condition (50) and so we have 

k= 1,2 



'=1,2 
Further, since Aj^ is a probability vector 

j =1,2 
(52) Ajk j + \*^ 2 = 1 

k= 1,2 
We now make a further restriction on the Xii, . We assume that 
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outcome Oj is associated with "reward" and 03 with "non- 
reward," [so that if *u - 1 and 1121 = (Aj is always "re- 
warded" and A 2 is never "rewarded") then V^j = l]. Equation 
(44) gives for these special conditions, 

*11,1 = 1 

(53) 

Ml,2 = 0- 
Equations (51) then give 



(54) 

*21,2 = 1 

These equations along with equations (44) and (45) then allow 
us to solve for y^,! an( * obtain 

1 - TToi 

(55) V^i = - - ^ - . 

2 - nn - "21 

This result was obtained earlier by Estes*in unpublished work 
The interpretation of equation (55) for two choice situations 
with risk is clear; it gives the asymptotic mean proportion of 
choices of alternative Aj in terms of the probability TT^ of 
reward of Aj and the probability ^21 of reward of A2. For 
example, if TTH = 0.7 and 7T21 = 0.2, then V^i = 0.73. Thus, 
the equal alpha case predicts that the organism will not learn 
to choose the favorable side all the time. Furthermore, there 
is evidence in some experimental situations that indicates that 
this result is approximately correct. 



7. APPLICATIONS OF THE SYSTEM 

In applying the mathematical system described above to the 
analysis of experimental data, a number of problems arise. 
First of all, one must set up identifications between elements of 
the mathematical system and observables in an experiment; the 
alternatives and outcomes in the model must be given empirical 
referents. Any rules for making such identifications are outside 
the mathematical system but are crucial in making use of it. 
Secondly, there are serious problems in statistical estimation 
of parameters, such as the a jfc and the components of the Xjk, 
from experimental data* Most standard statistical techniques 
are not easily applied to such a stochastic process. 
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Most of the applications of the system have been to problems 
with but two alternatives. Bush and Mosteller [3] describe the 
use of two special cases when r = 2, one to data on the avoid- 
ance training of dogs and the other to data from two-choice ex- 
periments on rats and humans. Estes has applied a mathemati- 
cal model which is essentially equivalent to the one described 
in this paper to a number of experiments but most of his work 
is still unpublished. Miller and McGill [6] present a model, 
which they show is a special case of this system, and apply it to 
data on rote learning. Finally, Flood has used the system for 
studying the behavior of people in a nine- choice punch- board 
experiment [5]. In these several studies, the estimation pro- 
cedures have involved either fitting the data to the marginal 
means in the model or using standard maximum likelihood pro- 
cedures. None of these procedures utilize all the information 
contained in the data and hence there is a need for better es- 
timation techniques. 



APPENDIX 

Since the manuscript of this paper was written a more gen- 
eral development of the combining of classes condition has been 
found which we present here. The original manuscript was 
shown to L. J. Savage who then proved the following theorem: 
To satisfy the combining of classes condition in three or more 
dimensions, a continuous stochastic operator S must be of the 
form Sp = ap + (l-a)X, where a is a suitably chosen scalar 
and X is a probability vector. In this appendix we prove a 
slightly more general theorem without using the assumption of 
continuity, and so extend the theorem to arbitrary stochastic 
operators. 

In the following, we will always imply that probability vec- 
tors and operators are r-dimensional, where r is finite and 
r > 3. 

Definition 1. By a stochastic operator S we shall mean an 
operator which sends probability vectors into probability vec- 
tors, i.e., Sp = p f where p and p f are probability vectors. 

Definition 2. A projection of a probability vector p = 
(Pl> P2 ' * > Pr) into the probability vector p f is obtained by 
choosing a non-empty sub-set a of elements of p and replacing 
all but one of them by zeros and that one by the sum of the 
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elements in the sub- set, while maintaining unchanged the ele- 
ments of p not in a. Any stochastic operator C that projects 
probability vectors is a projection operator and can be repre- 
sented by an rxr matrix. 

Clearly there are a finite number of projection operators 
corresponding to the finite number of sets a. 

Definition 3. A stochastic operator S is said to satisfy the 
combining of classes condition if and only if 

(1) CSCp = CSp 

for all projection operators C and all probability vectors p. 

Lemma 1. If S satisfies the combining of classes condition 
and p and p f are probability vectors such that Cp = Cp f , then, 

(2) CSp = CSp' . 

In particular we have, for the components Sjp of Sp 

(3) S jP - 5jP ! 

J(T j<7 

where a is the set defining C. 

Proof. If Cp = Cp f then CSCp = CSCp' and application of (1) 
to both sides shows that CSp = CSp 1 , which is (2). Expression 
(3) is obtained by reading off the proper component of the vec- 
tors in (2). 

Lemma 1 says that if the projections of two r-dimensional 
vectors on a sub- simplex are equal, then the projections of their 
images on that sub- simplex are equal. 

Lemma 2. If S satisfies the combining of classes condition 
then Sp = (yi(pi), y2(P2)> * ' % Vr(Pr)) where the yj are func- 
tions sending I into I (I is the unit interval, {x|0 < x < l} ). 

Proof, Let a = 1, , i-1, i+1, , r and let C be the 
projection operator defined for this cr. Let p be a fixed proba- 
bility vector; then for all probability vectors p 1 with pj 1 = pj 
we have .Z PJ = Z pj' = 1 - pi so that Cp = Cp f . Lemma 1 

J or J(T 

now implies that (3) holds, so that 

SIP = 1 - S jP = 1 - S jP ' = S iP ' . 
jeer jea 
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Since p f was an arbitrary vector satisfying pi 1 = PJ, and p 
fixed, this shows that SIP depends only on PI, and defines a 
single -valued function yi(pi) = Sjp. This function yt maps I 
into I because S is a stochastic operator so pi < 1 im- 
plies < yi(pi) < 1. 

Lemma 3, Let x be any point in I; then there exists a 
function y sending I into I such that 

(4) y t (x) = y(x)+yi(0) 

for i = 1, , r. Moreover, if x, y and x + y belong to I, 
then 

(5) y(x + y) = y(x) + y(y) . 

Proof, Consider two probability vectors p and p f such that 
P = (PI, P2> P3> ' i Pr) and p' = (PI + P2, 0, ps, , p r ). 
Then r 

(6) i-L yj(pj) = n(Pi) + 72(P2) 

i=3 



P2) 

Setting PI = and P2 = x in this equation we have 
yi(x)-yi(0) = y2U)-y2(0) = y(x) 

thus defining the function y sending I into I. By repeating the 
argument for other pairs of indices we obtain (4). 

Now let x, y and x + y belong to I; setting x = pj, y = p2, 
substituting (4) into (6) and simplifying, we obtain (5). This 
completes the proof of Lemma 3. 

We want to show that y is a linear operator, i.e., that 
equation (5) above is satisfied and in addition 

y(cx) = c y(x) 

for all x in I and all real constants c such that ex is in I. 
We will prove that this latter requirement is met by showing 
that y(x) = ]3x where j3 is a constant in I. 

Lemma 4. If y is a function sending I into I and y ( x + y) 
y ( x) + y ( y) for x, y and x + y in I, then y(x) = /3x for some 
non-negative constant 0. 

Proof. 5 Let y(l) = ft > 0. Then j3 = y(| +1) = 2y(i) so 

- 1 B \ R 

that y(~) = By induction y(-) = , Now for n > 2, 
& n n 



124 DECISION PROCESSES 

y(l + i) = 2 y(I) = 2 and, by induction, y( ) = -^ if m < n. 
' v n n' ' v n' n ' J ' ' v n ' n 

Suppose now that the assertion is false, namely, suppose that 
there exists an x such that y (x) = j3x + a, where a is a posi- 
tive real number. (For a negative an analogous argument holds.) 
Observe that y (x) = y ( (x - y) + y) = y ( x - y) + y ( y) so that 
y(x - y) = y(x) - y(y), if x > y. Hence, for all rational p such 
that x p belongs to I, we have 

y (x p) = y (x) y (p) = 0(x p) + a . 

Since a > there exists a positive integer N such that Na > 1. 
In the open interval < z < 1, choose any point y = x + p, with 

p a rational number. Such a point exists since the points of the 
form x + p are dense in I. The point Ny belongs to I since 

Ny < NJJ = 1. By induction one can show that y(Ny) = Ny (y). 

Moreover 

y(Ny) = Ny(y) = N[j3y + a] Na > 1 

contradicting the fact that y sends I into I. This contradiction 
establishes the lemma. 

Theorem 1. A stochastic operator S (for r 3) satisfies 
the combining of classes condition if and only if it has the form 

(7) Sp a ap + (l-a)X 

where X is a probability vector and a is a suitably chosen 
constant. 

Proof. The "if 1 part of the theorem follows easily by showing 
that an operator of the form (7) satisfies the combining of 
classes condition. 

For the "only if" part, observe that Lemmas 3 and 4 show 
that 

(8) yi(x) = /3x 



Define a constant a by means of the equation 

r 
(9) yi (0) = 1-a . 



We now have two cases to consider, namely a ^ 1 and a = 1. 
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yj(0) 
Casel. a ^ 1. In this case we set \i = ^ and note that 

= 1. 



Then we have from (8) that yj(x) = /3 x + (l-a)X^ . Let p be 
a probability vector; then 

1 = S 



so that ]3 = a . Thus yi(pi) = crpj + (l-a)X^ and the operator 
must be of the desired form (7). 

Case 2. a = 1. Equation (9) then implies that Zyj(O) = 
and, since each yi(0) is non-negative, this means that yi(0) = 
for all i. Let p be a probability vector; we have 



1 = Eyi(pi) = E/3 Pi = jSZpi = /3 

so that j3 = a = 1. Then y^pj) = pj, i.e., Sp = p, and hence S 
is the identity operator which can be written in the form (7) by 
setting a = 1. 

This completes the proof of the theorem. 

For the case r = 2, equation (1) is satisfied for all operators 
and imposes no restriction. Hence Theorem 1 cannot be proved 
if n = 2. Analogous situations occur in other parts of mathe- 
matics in which the lower dimensional cases are atypical. How- 
ever, it is frequently convenient to assume the operators have 
the form of (7) also when r = 2. 



FOOTNOTES 

1. In the appendix we generalize the combining of classes re- 
quirement to apply to all stochastic operators and then demon- 
strate that Theorem 1 below is still valid. 

2. For typographical convenience we write all vectors as row 
vectors, but it is clear from the context when a column vector 
is meant. 

3. This example was provided by H. Raiffa. 

4. A perfect set is a closed set which has no isolated points, 
that is, one in which every point is a limit point. 

5. The essential idea of this proof was suggested to us by R. L. 
Davis. 
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CHAPTER DC 

INDIVIDUAL BEHAVIOR IN UNCERTAIN 

SITUATIONS: AN INTERPRETATION IN 

TERMS OF STATISTICAL ASSOCIATION THEORY 

by 

W. K. Estes 
INDIANA UNIVERSITY 

INTRODUCTION 

Group decision processes depend upon the behavior of indi- 
viduals,, For this reason it is to be expected that theories de- 
veloped in social sciences and theories developed in experimen- 
tal psychology will not be unrelated. Inspection of certain of the 
more formalized theories of group behavior, e.g., theories of 
games and economic behavior, reveals that these theories in- 
clude, explicitly or implicitly, assumptions concerning charac- 
teristics of individual behavior. It is an attractive possibility 
that the descriptive laws or principles of behavior that enter 
theories of group behavior as axioms may be deducible from 
theories of individual behavior. 

We note that in a game or an economic situation the indi- 
vidual is called upon to predict or to attempt to control uncer- 
tain events. The sequence of events may be random or it may 
follow a pattern about which the individual has initially incom- 
plete information. We shall discuss in this paper two sets of 
experiments that have been carried out with adult human sub- 
jects (college students) in highly simplified experimental situa- 
tions. The first set will bring out certain aspects of the behavior 
of an individual in attempting to predict correctly the outcomes 
of a series of situations when the alternative outcomes occur in 
a random sequence (the individual not being informed that the 
sequence will be random). The second set of experiments will 
bring out certain aspects of the behavior of an individual in at- 
tempting to produce an event by choosing on each trial some one 
of a set of alternative responses, the different responses having 
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128 DECISION PROCESSES 

different probabilities (initially unknown to the individual) of 
producing the event. In each case we will show how the data 
can be handled in terms of a statistical theory of associative 
learning. 

It is contemplated that these simplified experiments may 
provide basic paradigms which can be progressively modified 
in the direction of situations directly relevant to utility theory 
and game theory. It will also be of interest to consider possible 
relationships between the model that accounts for the data of 
these experiments and models that are developed in connection 
with other decision processes,. 



CASE I: PREDICTION OF AN UNCERTAIN EVENT 

The experiments to be considered in this section all have the 
same basic design,, The activity studied is similar in some re- 
spects to that involved in predicting the results of roulette games 
or horse races; there are two important simplifications, how- 
ever. In these experiments the outcome of the situation has no 
utility for the individual (except that of being right or wrong in 
his guess) and the information available to the individual is re- 
stricted to what he can obtain from observing a series of repli- 
cations of the situation. 

In any one experiment, the subject is run for a series of 
trials. All trials in the series begin with a signal, S. In some 
experiments S has been onset of a light, in some the onset of a 
tone, in some simply a verbal signal from the experimenter. A 
short time (2-5 seconds) following the signal S, one of a set of 
alternative outcomes, Ej, 2, i E n , occurs and terminates 
the trial. Immediately after the onset of S on each trial the 
subject writes down his prediction as to which of the events 

E l> E 2> ' ' > E n wil1 occur on that trial; then he is permitted 
to observe which event actually does occur. The subject is 
given no information about the conditions of the experiment ex- 
cept that some one of the Ej will follow the signal S on each 
trial; he is instructed to do his best to make a good score (in 
terms of correct predictions), and to make a prediction on each 
trial regardless of how uncertain he may feel about the outcome. 
The events Ej, E 2 , > E n actually occur at random with 
probabilities irj, T^, 1 7r n . No communication between sub- 
ject and experimenter is permitted once the series of trials has 
begun. Trials are spaced at intervals of about five seconds or- 
dinarily. 
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The theory that has been applied to this situation with some 
success is a statistical model for associative learning that has 
been developed during the last five years by the writer and 
others. In this theory the behaviors available to the subject on 
any experimental trial are categorized into mutually exclusive 
and exhaustive classes (in the present experiment alternative 
predictions concerning the Ej) by means of experimental cri- 
teria It is assumed that the change in probability of any re- 
sponse class on a given trial depends upon the momentary en- 
vironmental situation and upon the state of the individual as 
defined in the model. When the model is interpreted in terms 
of the present experiment, it turns out that the rate of learning 
(systematic change in probability of making a given prediction) 
depends upon the characteristics of the momentary environmen- 
tal situation but that over a considerable series of trials the 
probability of making a given prediction tends to a stable asymp- 
totic distribution with the asymptotic mean, for a group of simi- 
lar individuals run under like conditions, being independent of 
the momentary environmental situation, in the present experi- 
ment, the nature of the signal, S. The dependence of PJ, the 
mean probability (for a group of like individuals) of predicting 
event Ej, upon n, the number of previous trials in the series, 
is given by the equation 

(1) p, (n) = TTJ - -L $ i [,, - F lf j (0)] [1 - t P 



where the 0{ represent the probabilities of occurrence on any 
trial of the N environmental determiners comprising the situa- 
tion at the beginning of the trial, TTJ represents the probability 
of the outcome Ej on any trial, and FI j (0) represents the ini- 
tial probability that the ith element in the set of environmental 
determiners would, taken alone, lead to a prediction of outcome 
Ej. This equation is derived from certain primitive assumptions 
concerning the learning process. The assumptions are given in 
references [3, 4]. 

Since the stable course of action arrived at by the individual 
over a series of trials is the feature of the situation that is apt 
to be of most interest in relation to group decision processes, 
we shall consider only the asymptotic distribution of pj in this 
paper. It can be seen from inspection of equation (1) that the 
asymptotic value of PJ is independent of the initial state and also 
of the distribution of 0j. Therefore we can conveniently simplify 
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the model for our present purposes by assuming all of the 0^ 
equal to some value 0, which amounts empirically to assuming 
that all aspects of the environmental situation obtaining at the 
beginning of each trial are equally likely to affect the subject. 
Then the difference equations leading to equation (1) in the the- 
ory reduce to the following set of linear transformations: 

(2) (a) If Ej occurs on trial n, then 

Pj(n + l) = pj(n) + 0(l-pj(n)) 

(b) If EJ fails to occur on trial n, then 

Pj(n + l) = pj(n) - 0pj(n) 

(c) If Ej occurs with probability TTJ, then on the average 

Pj (n+ 1) = pj (n) + 0(?Tj - PJ (n) ) 
and we have asymptotically, 

(3) (a)pj() = TTj 

and (b) a 2 (*) = ^(1-^)07(2-0) 

Pj j j 

It will be noted that the equations (2) and (3) can be ob- 
tained from the linear operator model of Bush and Mosteller [l] 
if suitable restrictions are imposed upon the parameters. 

Data from two experiments of the sort under consideration 
are summarized in Figure 1 below. 

The bottom curve represents data collected by James H. 
Straughan and the writer at Indiana University. Subjects were 
30 college students The signal, S, was a pattern of four lights 
which flashed for one second at the beginning of each trial. The 
outcomes Ej and 2 were the appearance or non-appearance 
of a single light two seconds after the signal pattern. The trials 
on which Ej appeared were determined in advance of the ex- 
periment by a random number table with TTJ = .25 and ^2 = -VS, 
Each point on the curve represents the mean relative frequency 
of predictions of Ej in a block of 10 trials. The three upper 
curves represent data from a very similar experiment conducted 
by Jarvik [7] at the University of California, with TTJ equal to 
.60, .67, and .75 for three groups of subjects respectively. In 
Jarvik 1 s experiment the signal at the start of each trial was ut- 
terance of the word "now" by the experimenter and the two al- 
ternative outcomes, Ej and 3 were utterance of the word 
"check" or the word "plus," respectively, by the experimenter. 
The Jarvik data are plotted in terms of mean relative frequencies 
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7T l = .75 




Blocks of Trials 



Fig. 1. Data from prediction experiments plotted in terms 
of mean proportion of Ej predictions per trial block. 



of "check" predictions over blocks of 11 trials. It will be seen 
that in all cases the mean value of pj levels off in the neigh- 
borhood of the predicted asymptote by the end of 60 to 80 trials. 
The agreement of theory and experiment in this instance is es- 
pecially interesting in view of the fact that the theoretical pre- 
dictions of pj (>) utilize no degrees of freedom from the data. 
Similar experiments by Humphreys [6], who originated the ex- 
perimental design, and Grant [5] report data which are in ac- 
cord with theoretical predictions insofar as asymptotic means 
are concerned. The Jarvik, Humphreys, and Grant papers do 
not report individual data or variance estimates. 

The reader may be concerned at this point over the fact that 
the experimental curves cited represent group performance. 
The terminal mean probabilities of Ej predictions by groups of 
subjects agree with the theory in all instances, but according to 
the theory, not only group means but also mean response proba- 
bilities of individual subjects should approach v\ asymptotically. 



132 



DECISION PROCESSES 



We cannot tell from the group data whether this feature of the 
theory is verified. A mean response probability, pj (n), equal 
to TTJ could arise if the values of pj (n) tended to cluster around 
irj, but it could also arise if the proportion TTJ of the subjects in 
a group ended up at pj (n) = 1 while the remainder went to 
PI (n) = 0. As a matter of fact, inspection of individual data from 
a number of experiments run in the writer's laboratory shows 
that the latter possibility can be rejected. To illustrate this 
point, we have plotted in Figure 2 the records of each of a group 
of four subjects run in one of these experiments under somewhat 
better controlled conditions than the group studies. The values 
plotted are proportions of E^ predictions per ten- trial block. 
According to theory, these individual curves, regardless of ini- 
tial value, should tend to approach TT^ = .85, the relative fre- 
quency of EI, as learning progresses. Three of the four 
records certainly do this; the fourth record is in some doubt, 
not having stabilized anywhere by the end of 120 trials. 



1.0 



P(n') 0.5- 



0.0 



1.0 



P(n') 0.5 



0.0 



12 



6 
n 1 



12 



Fig. 2. Individual records from a prediction experiment 
plotted in terms of proportion of EJ predictions, P(n'), 
per block of ten trials, n 1 . 
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It may be noted that during the later trial blocks, the plotted 
values for these subjects tend to fluctuate around the ,85 level 
rather than remaining constant. This asymptotic variability is 
also in accordance with theory. Since the theoretical asymptotic 
probability of Ej predictions is .85, the standard deviation of 
proportions of Ej predictions per block of ten trials at the 
asymptote will be expected to be approximately (neglecting a 
covariance term) 



.85 X .15 
10 

In the case of the experiment represented in the lowest curve 
of Figure 1, the inter-subject variability around the mean curve 
decreases steadily throughout the series of trials. In the last 
two blocks of trials over 60 per cent of the subjects have pj 
values (i.e., proportions of Ej predictions) in the range .1 to 
.3 as compared with 33 per cent in the first block, and the pro- 
portions of zero and one values do not increase at all during the 
series. 

It will be noted that the course of action adopted by the sub- 
jects in these experiments does not maximize the expected fre- 
quency of correct predictions. Take, for example, a series with 
TTJ equal to .25. The subjects settle down to a relatively steady 
level at which they predict Ej on 25 per cent of the trials. This 
behavior secures the subject an expected percentage of correct 
predictions 

(25 x .25 + 75 x .75)% = 62.5% 

whereas the "pure strategy," in the von Neumann and Morgen- 
stern sense, of predicting the more frequently occurring event 
on all trials would yield an expectation of 75 per cent correct 
predictions. 

It would be of interest in connection with the broader problem 
of decision processes to modify the design of these experiments 
on prediction by introducing systematic variation in the informa- 
tion available to the subject and in the utility attached to the out- 
comes Ej, 2, y E n . 



CASE II: ATTEMPTING TO CONTROL AN UNCERTAIN EVENT 

A group of experiments which are closely related, in terms 
of the theory, to those described above, study the behavior of the 
individual in attempting to control an uncertain event. In one 
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variation of this experiment, two telegraph keys are available to 
the subject and he is required to choose one or the other on each 
trial. On a panel in front of the subject there is mounted a lamp 
globe. The subject is told that on each trial one key is "correct" 
and one key is "incorrect" in accordance with some scheme or 
plan that is entirely unknown to him except for what he can learn 
during a series of trials, and is led to believe that whenever he 
chooses "correctly," the light on the panel will flash. Actually 
the light is operated by a device programmed by means of a 
random number table so that the light flash will follow choices 
of key #1 with probability n^ and choices of key #2 with proba- 
bility 7T 2 . 

As in Case I, a theoretical account of this experiment can be 
derived from the statistical model [3,4]. For purposes of the 
present discussion we will give here simply the end result of 
the derivations. If we again simplify the model by assuming 
that all components of the situation S have equal probabilities 
of influencing the individual's response, then the changes in re- 
sponse probability on any trial are described by the linear 
transformations given in the table below. Let pj(n) represent 
the probability of choosing key #1 on trial n. 

Key Chosen Outcome Expected Change in PI 

1 Ej Pl (n 4- 1) = p x (n) + 0(1 - Pi(n)) 

1 E 2 Pjfa+1) = Pi(n) - 0pi(n) 

2 Ej p x (n + 1) = pj(n) - flp^n) 
2 E 2 Pi(n + 1) = pj(n) + 0(1 - 



Then under the conditions stated above, we obtain for the ex- 
pected change in mean probability of choosing key #1 



(4) pj (n + 1) = 9(1 - 7r 2 ) + (1 - 29 + 07TJ + 0rr 2 ) p"i(n) 
And for the asymptotic mean 

(5) pi (oo) = 2-^*2 

The data reproduced in Figure 3 are taken from an experi- 
ment conducted at Indiana University by Marvin H. Detambel [2]. 
Detambel ran four groups of college students under the condi- 
tions described above with the following values of TTI and 7r 2 
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i.oo 




-7r = 1.00 



* = -67 



TTj = .50 



Blocks of Trials 



Fig. 3. Mean proportion of Ej predictions per four- 
trial block in Detambel's two-key, contingent reinforce- 
ment experiment. 
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Theoretical Asymptote, pj(<) 
U) 
0.5 
0.67 



Predicted values for asymptotic mean probabilities of choos- 
ing key #1 are readily computed from equation (5) and the 
values for Detambel's groups are included in the table just 
above. In the case of Group IV, which has TTJ = 0.5 and 1*2 = 0, 
we see that the course of action to which the subjects tend over 
a series of trials is far from the " pure strategy" which would 
be the best solution to the situation from the standpoint of maxi- 
mizing successes. Over the last half of the series the subjects, 
on the average, select key #1 approximately 67 per cent of the 
time, in accordance with theoretical expectation. Under the con- 
ditions of the experiment, this course of action yields as the ex- 
pected percentage of successes 
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67 x. 50 + 33x0 = 33.5% 

whereas by selecting key #1 on all trials the subjects would 
have raised their expectation to 50 per cent successes. 

The behavior pattern exhibited by these subjects may be 
better suited, in some respects, to dealing with environmental 
uncertainties than the strategy of going over to a p value of 
unity on the more frequently reinforced response. The "pure 
strategy" would be optimal only if the sequence of environmental 
events constituted a stationary time series. Actually in this ex- 
periment the subjects were not told that the TT values would re- 
main constant throughout the series and they had no sound basis 
for inferring constancy. The compromise solution of the prob- 
lem arrived at by the subjects would be advantageous if the en- 
vironmental probabilities were to change at any point in the 
series so that the formerly unfavorable response became the 
more favorable. If, for example, the probability of reinforce- 
ment on key #2 in the Detambel experiment had been changed 
from zero to .75 halfway through the series, the subjects would 
soon have shifted their response probabilities to a level appro- 
priate to the new situation, whereas a "rational" subject operat- 
ing under a "pure strategy" of selecting key #1 on all trials 
would not have discovered that the balance of probabilities had 
changed. It is very doubtful, however, that many individuals in 
these experiments work out a solution at a verbal level. Proba- 
bly long experience with environmental uncertainties has de- 
veloped relatively stable habits of response to reinforcement 
and non- reinforcement. In this situation it appears that the 
subject proceeds as though failure of reinforcement on one key 
means that the other response would have been correct on the 
given trial, although as the experiment is conducted this actually 
is not generally true. Subjects might eventually change their be- 
havior pattern, but under experimental conditions they give little 
evidence of doing so even over series of considerable length. 

Both types of experiment seem to support the conclusion that 
in a simple decision process the human subject tends to behave 
in accordance with the principles of associative learning and not, 
in general, in the most rational manner as "rational" is conven- 
tionally defined. The suggestion arises, then, that in formulating 
theories of group decision processes it may be worth while to 
draw upon the principles of individual behavior revealed by ex- 
perimental-theoretical research of the kind described here 
rather than to depend upon common sense notions concerning 
characteristics of individual behavior. 
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CHAPTER X 

ON GAME-LEARNING THEORY 
AND SOME DECISION-MAKING EXPERIMENTS 

by 

Merrill M. Flood 
COLUMBIA UNIVERSITY* 

1. INTRODUCTION 

The theory of games [14] provides a general mathematical 
model that may sometimes be used to approximate a real situa- 
tion. Most often, in real cases, the situation is much too compli- 
cated to permit formulation even conceptually as a formal game. 
In the few cases that can be so formulated, it is almost always 
impractical to attempt gathering the necessary data or to do the 
elaborate calculations required for a solution. 

The non- constant- sum case, even with two players, remains 
unsolved in the sense of von Neumann- Morgenstern. There are 
theoretical proposals that dispose reasonably well of the two- 
person case, and of many other broad special cases; we have 
discussed some of these in another paper [8]. 

In this paper we investigate game-like situations in which the 
players are limited biologically in their choices of moves. These 
limitations are reflected in the method of play of the formal game 
and stem from the notion that animal organisms seem to learn by 
some sort of conditioning process that alters the probability that 



*This work was done while the author was with The RAND 
Corporation, under Project RAND of the Department of the Air 
Force, and the results were presented in July 1952 at the Santa 
Monica conference on decision-making. The present paper is a 
condensation of an earlier memorandum [9], and was prepared at 
Columbia University while the author was with the Behavioral 
Models Project of the Office of Naval Research. Grateful thanks 
are also due to Dr. D. R. Fulkerson for a careful reading of the 
manuscript, and for many helpful comments during the course of 
the work. 
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some one of several mutually exclusive alternatives will be se- 
lected in each new instance. 

This approach was suggested to us by the work of R. F. Bales 
and A. S. Householder [l], [2] on the group interaction process, 
and is closely connected with the work of R. R. Bush and C. F. 
Mosteller [4] on mathematical models for learning. We have 
profited from discussions with all four of these men. There is 
also an interesting philosophical discussion of stochastic learn- 
ing models in a recent paper by D. M. MacKay [12], and a stimu- 
lating essay by E. G. Boring [3] on M robotology M ; both of these 
latter papers seem to support our methodological viewpoint. 



2. GENERALITIES 

The approach used in this paper is applicable to situations in- 
volving more than two organisms, but we shall concentrate some- 
what on the two-player case. A player could in fact be a group of 
people, or a component of personality within one individual, but 
we only touch upon such interpretations. Since our main object 
is to treat some one case of real behavior, we shall usually be 
content with a discussion in terms of a special real- life situation, 
leaving broader interpretations to the reader. 

The connection with game theory is the correspondence be- 
tween the notion of choice of a strategy for a game in normal 
form and the notion of individual choice of course of action in 
biological activity. A very fundamental case, and perhaps the 
simplest one, is the problem of choosing whether or not to act 
in a situation where there appears to be only one choice: acting 
or not acting. For example, in experiments like those of B. F. 
Skinner [13] with rats, the choice at some moment is whether or 
not to press a bar. For a human example, the choice might be 
whether or not to accept a particular offer for a new position. 
In these examples, and in most real-life situations, the organism 
somehow reduces its range of alternatives to a relatively small 
number from which it feels it must choose. 1 It is this recog- 
nized field of choices, whether they are considered to be con- 
scious or unconscious alternatives, that corresponds to the set 
of strategies listed in the normal form of the formal game. 

In what follows, we shall try to use such terms as "game", 
"strategy", "move ft , and "player" from game theory only with 
the meaning attached formally by von Neumann and Morgenstern 
[14]; for other purposes we shall use alternative words such as 
"situation", "plan", "act", and "subject". 
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3, THE PROBLEM 

The games in which we shall be interested are defined in 
terms of expectation functions: 

v l ~ V lli2 ' ' ' i n for (i k = 1 ' 2 ' ' ' ' ' m k ; j = !' 2 ' ' * ' ' n) ' 

A play of the game consists of simultaneous independent choices 
of specific values ig for the i k by the n players. The quantity 

V^ o is the expectation for player j, where the units for V? o re- 

late to a measure of the utility attached by player j to the pay- 
jnents he receives. The functions V\ are real-valued. 

The actual payment to player j , when the choice of pure 
strategies is i. on a play, is a quantity x given by the distribu- 
tion function P^(x) whose mean value is V} Of course |x| is 
bounded, so that 



PJ(x)=0 if |x| >b| . 



Our problem is to select a good method of play that can be used 
by player l> when his information about the structure of the game 
is knowledge only of 

-J -4 

m* and a bound b > max bj >0 , 

* i 

and where his information about the distribution functions P](X) 
is gained entirely from his experience while playing the game. 

It is customarily assumed [14] in the process of passing from 
the normal to the extended form of a game, that only the mean 
values V] of these distributions affect the situation; we could 
even assume without essential loss of generality, therefore, that 
the variance of Pj(x) is zero so that x only assumes the value 
Vj. Furthermore, since the problem is essentially unchanged if 
the utility measure x is subjected to a linear transformation we 
could also take b^ = 1 and suppose that P^(x) =0 if x<0 or x>l. 

The experience gained by player j in N plays of the game 
consists of a record of his own choices ij(t), and receipts n(t) 
for t = 1, 2, , N. The central problem is to find a rule of 
play that will tend to maximize total receipts in a sequence of 
plays where the player is given some information about the num- 
ber of plays before he starts on a sequence; we are intentionally 
vague at this point about the exact nature of the rule of play and 
about the advance information concerning the length of sequence. 

We shall be interested in what follows, then, only in the game 
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whose normal form has the expectation functions 

< V j < 1 . 
i 

We shall be especially concerned with one extended form of this 
game in which there is one chance move for each player and the 
actual payments are always unity or zero, whence the probability 
of a unit- payment to player j is v| if the players choose the 
pure strategies i. We have noted tnat any game can be reduced 
to this form by suitable linear transformations on the utility 
measures of the individual players, provided only that there are 
known finite bounds on the possible payments; use was also made 
of the assumption that games in extended form are equivalent if 
their normal forms are identical. 



4. THE GAME- LEARNING MODEL 

The type of rule of play, that is investigated here, is repre- 
sented for player j by the relation 

pJ(t + 1) = Ml h pJ(t) for h = 0, 1, 2, , 2mj + 1, 

where pi(t) is an mj + 1 dimensional probability vector and the 
M^ are given square stochastic matrices whose elements are 
non-negative real numbers that sum to unity in each column. 
The components p|(t), for i = 0, 1, 2, , mj, are the proba- 
bilities that player j selects value i for ij on play t; pj^t) has 
a special significance to be discussed later. 

We now describe how the appropriate operator M^ is chosen 
after play t. The matrices M^ h are first separated into two 
classes of mj+1 members each, denoted Rl^ and PJ^ for 
k = 0, 1, 2, , mj, and either RJ ijW or PJ ijW is selected; 
the choice between these two matrices is made with probability 
rj(t) in favor of RJ *jW. In the special case in which rj(t) is al- 
ways or 1, we can write the rule of play in the form: 

p(t + 1) = { P (t) RJ 4 jW + (1 - p(t)) PJ *jW} pJ(t). 



We have defined a method of play that can be used by any 
player after he has made his initial strategic choice pJ(0), and 
after specific values have been assigned for the elements of MJ h ; 
in actual practice he will need to know mj and EJ also. 
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5. THE FUSION GAME- LEARNING MODEL 

We shall now consider a special parametric form for the 
matrices MJ h . For convenience, we shall henceforth omit the 
designation of the player when this leads to no ambiguity. 

We set: 

and 



for (i, a, = 0, 1, 2, , m), where a 1 , b 4 , and c* are in the 
closed interval [0, l] and 600 is one or zero according as a= j3 
or not. This special form of the more general Markov game- 
learning process was developed by Bush and Mosteller [6] so as 
to fit data obtained in a number of learning experiments with rats; 
they have named it the "fusion model." 

We shall be interested in the case in which all but one player, 
say number 1, choose constant strategies pJ, but where player 1 
uses the fusion model. This means that the probability that 
player j > 1 will select the value i for ij is p\ on each play, 
and for player 1 it is p[(t). Since these choices are all made 
independently it follows immediately that the expectation of 
player 1, if he chooses the value x for i^ on play t, is: 

m2 m n 

p . v* V v D* py pP 

U X " LJ LJ V xi9 i ^19^13 In* 



We may suppose, without loss of generality, that: 



It follows that player 1 could do no better than to choose 

P?(t) = 

so that his expectancy on each play is G m .. We shall be inter- 
ested in comparing his expectation when he makes use of the 
fusion model with this maximum possible expectation G mi ; of 

course, his success with the fusion model may also depend upon 
his choice of the starting vector p*(0). 

The component p^t) is interpreted in the game situation as 
the probability that no choice will be made by player j at time t. 
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This feature can be introduced mathematically into the game 
model by defining: 

VJ = if any component of i is zero, 

where the range of i k has been extended to include zero. This 
augmentation of the original game problem will be used whenever 
the fusion game-learning model is under discussion. 



6, A SPECIAL FUSION MODEL 

We shall now consider a specialization of the fusion model in 
which a*, b*, and c* are positive and independent of i; we denote 
their common values a, b, and c. We shall also suppose that the 
quantities G x , for x = l, 2, , m, are distinct and non-zero, 
and that G = 0. 

The expected value of p 4 **, given p*, is: 




a=0 

This can be rewritten to yield the following relation for the k th 
component of the expected value of p*+*: 

m m 

(a-b-c) pG a p{ c + (c-a)G k pt+b p^dM, 
a=0 a=0 

for k = 0, 1, 2, , m. 

It follows easily that a vector V* 1 satisfies the equation QV = V 
if and only if it is the unit vector e o or has the following form: 

= 1, 2, -, m). 

Bush and Mo s teller [5] have called the matrix Q the "expected 
operator' 1 and they have made use of the vectors V* 1 in discussing 
the asymptotic behavior of p 4 . 

Rather little is known concerning this asymptotic behavior, and 
still less is known about methods for estimating the parameters 
in the fusion model from experimental data, so we shall have to 
resort to Monte Carlo computational methods in our exploration 
of the properties of the fusion model. For this purpose, we shall 
turn to some numerical examples. 
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1. A RAT EXPERIMENT 
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The rat Su must choose one of two rooms. Five seconds after 
a warning bell the rat is either rewarded (fed) or punished 
(shocked) by the experimenter Ex, and Su does not see what the 
result would have been had Su chosen the other room on that 
particular trial. 

Ex rewards or punishes according to a rule prescribed in 
advance. The experimental situation may be summarily de- 
scribed as a two-person game. The payoff matrix for Su is: 



Su 



Ex 



Places food in: 



Room 1 



Room 2 



Sits Room 1 
Room 2 



1 

-y 



-y 
i 



At the moment, we are not interested in the payoff matrix for 
Ex but shall simply suppose that Ex has chosen a strategy (n\, 712) 
such that Ex places the food in Room i lOOn^ per cent of the 
time. The expected payoff for Su is then V(pj) = pj(27Tj-l)(l^y) + 
(l-TTj-yTTj), where y is non-negative and pj denotes the propor- 
tion of the time Su sits in Room i. Now in this situation, even if 
Su had superior human intelligence, game theory would give Su 
little real help in choosing a strategy because there is no mean- 
ing to attach to the notion of payoff matrix for Ex. Even if there 
were a payoff matrix for Ex, the game would probably be non- 
constant sum and the value of y would vary from Su to Su as 
well as from time to time, and be difficult to estimate. Never- 
theless, a rat or a human found in this situation does behave in 
some fashion, and our scientific problem is to explain and pre- 
dict actual behavior as well as possible. 

Before turning to the game-learning theory approach, it may 
be instructive to discuss the situation in the usual manner, from 
the standpoint of rationality. For example, if TTJ = 1/2, it does 
not matter what Su does, since the result is independent of his 
choices. If TTJ ^ 1/2, then Su should choose pj = 1 or pj = 
according as TTJ > 1/2 or TTI < 1/2. On the other hand, if Su 
feels that its past behavior (including its biological characteris- 
tics) may be analyzed intelligently by an Ex that strives to mini- 
mize the payoff to Su by choosing a time- dependent strategy in 
terms of past behavior of Su, then Su should somehow protect 
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against this unwanted result by concealing its pattern of behavior 
from Ex (perhaps by randomization as proposed in the theory of 
games). These dynamic cases are entirely outside the scope of 
present formal game theory, of course, and are the principal 
cases of interest here. 

The basic assumption from learning theory is that Su varies 
its behavior according to the pattern of its past experience. The 
special mathematical form assumed here for this effect is the 
special fusion model of section 6, with m = 2. The matrices R^ 
and P 11 are, therefore: 

-c b b \ /1-c b b 

> n - ' c 1-b c 1, R 12 = ( 1-b-c 
1-b-c/ \ c c 1-b 



P 12 





R 1 is used after Room i is chosen when the food was placed 
there, and P* is used after Room i is chosen when the food was 
not placed there. Of course, Ex, as player 2, may be repre- 
sented by the relations: 

r \ 

P 2 (0) =1 it I , and R 2 > = P 2 ' = 

w 

The TTJ correspond exactly to the Gj of section 5. Li this case, 
Ted Harris has shown (see [9]) that the asymptotic value of 
P*(t) is 




1 



e o = 
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whatever the value of p*(0); indeed, the probability is 1 that 
there will eventually be an unbroken sequence of applications of 
P that terminates the process. 

Monte Carlo computations were made for this model with 
numerical values for the parameters, chosen in agreement with 
estimates by Bush and Mosteller [6] on the basis of data from 
learning experiments with rats, as follows: 

a=b=d=0.01, c=0.10. 

These computations were made for a rather careless assortment 
of values for p*(0) and TTJ, and the main results are shown in 
Tables 1A-1G. All the computed cases show a strong tendency 
for p to seek an equilibrium near the value 0.1, and it is in- 
teresting also that P2 seemed always to go to zero when 7ri>0.5; 
this constitutes a tendency toward optimal behavior since only 
choice of Rooms 1 and 2 represent actual decisions by the rat 
under our interpretation of the fusion model. 



TABLE 1 
Stat-rat Strategies 



1A: 



=0.5 



t 


PO 


PI 


P2 





100 


450 


450 


5 


95 


555 


350 


10 


94 


450 


456 


30 


91 


512 


397 


50 


108 


446 


446 



IB: 



= 0.5 



t 


PO 


Pi 


P2 








500 


500 


5 


19 


579 


402 


10 


46 


617 


337 


30 


86 


673 


241 


60 


106 


806 


90 


90 


97 


869 


33 


115 


97 


897 


4 



1C: 



= 0.5 



t 


PO 


Pi 


P2 





900 


50 


50 


5 


895 


58 


47 


10 


900 


53 


46 


30 


900 


52 


48 


60 


692 


281 


25 


90 


355 


374 


271 


120 


193 


472 


335 


150 


111 


589 


299 


180 


139 


722 


139 


210 


97 


762 


141 


240 


98 


866 


36 
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TABLE 1 (Continued) 



ID: TTJ = 0.51 



t 


PO 


Pi 


P2 





800 


100 


100 


5 


717 


196 


87 


10 


717 


202 


80 


30 


398 


337 


265 


60 


337 


497 


165 


90 


148 


606 


247 


120 


116 


815 


69 


150 


98 


893 


9 


180 


115 


623 


262 


210 


98 


371 


31 


240 


88 


906 


6 



IF: 7T! = 0.9 



t 


PO 


Pi 


P2 





900 


100 





5 


735 


265 





10 


744 


256 





30 


593 


406 





60 


126 


874 





90 


106 


894 





120 


90 


908 





150 


111 


889 





180 


92 


908 





210 


98 


902 





240 


101 


898 






IE: 



= 0.55 



1G: 



= 1.0 



t 


PO 


Pi 


P2 





800 


100 


100 


5 


721 


193 


86 


10 


656 


262 


81 


30 


411 


439 


151 


60 


239 


715 


45 


90 


118 


875 


7 


120 


98 


901 


1 



t 


PO 


Pi 


P2 





114 


100 


786 


5 


109 


96 


795 


10 


104 


179 


715 


30 


107 


716 


177 


60 


99 


892 


8 


90 


107 


892 


1 



8, HUMAN SUBJECTS 

All that has been said about the game -learning model is ap- 
plicable in the analysis of experimental data with human subjects 
There may be a considerable advantage in using human subjects 
since the conditions of the experiment can be explained to them 
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easily, and because their choices are made quite rapidly. Some 
very tentative trials were run in order to gain some experience 
with the experimental situation, as a first step toward a design 
of more conclusive trials. 

In the first series the subject Su was asked to call "head" or 
"tail" in an attempt to match the random choice made by Ex with 
fixed probability TTJ. The success of Su was compared with that 
of the special fusion model (stat-rat) used in section 7, where 
random numbers were used to yield TTJ = 0.55. The number of 
trials was too small to permit any quantitative conclusion to be 
drawn, but it seemed likely that a more extensive series of 
trials would be worthwhile. The scheme was discontinued in 
favor of more promising ones to be discussed next. 

The general 3x3 zero-sum symmetric game, which has no 
pure strategy as a solution, is represented by a three-parameter 
payoff matrix for its normal form: 



V = 



where u, v, and w are positive. The solution of this game is 
the unique mixed strategy: 




u+v+w 



(w, v, u) . 



There were three trials in each of which a subject played a 3x3 
zero- sum symmetric game against the stat-rat, as defined nu- 
merically for the special fusion model in section 7. For these 
trials: 

(a) The absolute values for u, v, and w were taken directly 
from a table of random numbers; 

(b) The subject was not told the payoff matrix but was told 
the exact method used to select it; 

(c) The subject was told that he was playing against a rat in 
mathematical form. 

The three payoff functions were: 



Subject 


U V W 


Solution 


RF 


-668 


Col. 2 


MD 


6-87 


Col. 1 


MF 


6 10 -4 


Col. 3 
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These three games each have a pure strategy for a solution, as 
shown in the final column of the table just preceding. The results 
of the three trials are summarized in Table 2. The trend in the 
stat-rat 1 s mixed strategy is shown for each game in Table 2, 
along with an estimate of the mixed strategy in use by the sub- 
ject based on the average of his ten choices centered at the play 
listed. Again the data are too scanty to justify careful analysis, 
or quantitative conclusions, and this type of trial was discontinued 
in favor of a more promising one to be discussed next. 



TABLE 2 
3x3 Symmetric Games 





RF game 


MD game 


MF game 




RF 


Stat-rat 


MD 


Stat-rat 


MF 


Stat-rat 


No. of plays 


19 


19 


25 


25 


20 


20 


No. of wins 


6 


13 


5 


20 


10 


10 


Percentage wins 


32 


68 


20 


80 


50 


50 


Through Play 




Frequi 


mcy of 


Use of S 


jlution 


* 


1 





.33 


__ 


.33 


_ 


.33 


6 


.3 


.45 


.4 


.61 


.8 


.55 


10 


.4 


.58 


.5 


.68 


1.0 


.66 


15 


.8 


.72 


.6 


.70 


1.0 


.70 


20 





.76 


.7 


.74 





.80 


25 











.74 









*For the stat-rat the frequency through play x is computed 
as the ratio of 



and for the subjects it is one-tenth the number of wins in 
the ten plays centered at play x. 
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There are really two rather different types of problems in- 
volved thus far in our discussions of game-learning situations 
in which the payoff functions are unknown: 

(a) (Static) Those situations in which it is assumed that the 
opponents of the main player choose and use a fixed mixed 
strategy for the duration of a sequence of plays, 

(b) (Dynamic) Those situations in which it is assumed that 
the opponents of the main player may vary their strategic 
behavior during the sequence in a manner that somehow 
takes account of the results they obtain on earlier plays. 

The static case, with known payoff functions, is the usual one 
considered [14] whereas our interest centers here on the dy- 
namic case with essentially unknown payoff functions. The 
static case always reduces to one in which there is a set of 
numbers G x , in the closed interval [0, l], that represents the 
payment expectation if our main player chooses pure strategy x 
on a given play and the numbers G x remain constant for the 
sequence of plays; the dynamic case takes the same form except 
that the numbers G x may vary in some manner that is dependent 
upon the choices made by our main player in preceding plays. 
The game-learning model is equally applicable in either the 
static or the dynamic case. 

The game of Morra [7] is a convenient one for our purposes 
both because it is of handy size (9x9) and because it has been 
completely solved. The static case was examined experimentally 
for Morra by having two subjects and the stat-rat play the game 
knowing only that it was 9x9 and symmetric, and that their op- 
ponent would not be using a game-theoretic solution. Subject BC 
had no knowledge of game theory and Subject RB is a mathemati- 
cian who is expert in game and decision theory. Actually, a fixed 
pure strategy, not in the solution mixture, was used in opposing 
the subjects and the stat-rat; it is represented by the following 
set of values for G x that were used against BC and the stat-rat, 
those for RB being 2/3 as great: 

G x = (.500, .500, .833, .250, .250, .500, .500, .500, 1.000). 

The results of play are summarized in Table 3. The data are 
still too skimpy to permit any conclusions to be drawn. 

There is no particular reason, in the static case at least, why 
the experimental values chosen for G x should come from a game 
that has a well-known extended form. Consequently, we have 
come to the following type of experiment as the most promising 
one to use in obtaining data on the behavior of human subjects to 
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be used in estimating parameters in the fusion models and thus 
eventually to test the hypothesis that this mathematical model 
represents human learning behavior. The G x are chosen from 
a random-number table and the subject then is asked to play a 
number of times fixed in advance in an effort to maximize his 
total number of wins; RB and AM, both experts in the relevant 
mathematical theories, served as subjects for trials in which 
there were 1,000 plays and: 

G x = (.04, .03, .25, .61, .64, .33, .44, .44, .75, .41). 



TABLE 3 
Static Morra 





RB 


BC 


Stat-rat 1 


Stat-rat 2 


No. of plays 


67* 


372* 


29 


43 


No. of wins 


24 


213 


14 


21 


Percentage wins 


40 


57 


48 


49 


Through Play 


Frequency of Use of Solution 


1 








.100 


.100 


5 


.1 


.2 


.077 


.186 


10 


.6 


.1 


.058 


.155 


20 


.5 


.1 


.030 


.260 


30 


_ 


.1 


.017 


.304 


60 





.1 








90 





.1 


ii 




120 





.2 


___ 


. 


150 





.2 








210 





.2 


. 


i 


270 


_ 


.1 


.I. 





330 


_ 


.2 


_ 


____ 


360 





.0 









*The subject announced at this point that he would con- 
tinue playing pure strategy No. 9 indefinitely, and so had 
in effect "solved" the game. RB made this tentative de- 
cision after 32 plays and used the other 35 simply to 
confirm his decision. 
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The results are given in Table 4; it seems unlikely that the stat- 
rat would match this performance, but it would probably take a 
good many trials to give a statistically significant test of this 
conjecture. 

One static-nine game has been played by the stat-rat with ten 
replications. In this play, pg = 0.1 for a = 0.1, , 0.9, and 
the values for G x chosen from the random-number table were: 

G x = (.097, .510, .433, .274, .442, .364, .503, .929, .256) 

This was the first IBM machine run and the computations were 
carried to eight decimal places for two hundred steps each. In 
nine out of the ten cases, the value of p200 was essentially such 
that p = 0.9, p = 0.1; in other words, the stat-rat had 
reached the optimum strategy in 200 trials. In the tenth case 
the value of jor 00 , if rounded off at the third decimal place, was 
essentially p| = 0.9, p^ 00 = 0.1; in other words, the stat-rat 
had reached a very poor strategy in 200 trials. Other details of 
this run of static-nine are given in Table 5, including the "win- 
ning rate" w* which represents the expectation on the first de- 
cision after time t, where: 

9 



9 





TABLE 4 
Static -10-Game 





RB 


AM 


No. of plays 


1,000 


1,000 


No. of wins* 


719 


715 


Percentage wins 


71.9 


71.5 


No. of last play before 


133 


204 


deciding permanently 






on strategy 9 







Based on expectation after decision to play strategy 9 
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TABLE 5 
Static-9 Game 





Average 


Game Number 




o v/* 1 






Item 


Game 9 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


Time at which 
























winning rate 
























first exceeded: 
























.84 


59 


41 


29 


58 


88 


98 


65 


72 


58 





26 


.90 


81 


97 


41 


88 


99 


111 


76 


92 


68 





36 


Winning rate 
























at time: 
























50 


70 


87 


84 


73 


44 


58 


62 


59 


74 


36 


92 


100 


91 


90 


93 


92 


90 


82 


93 


91 


93 


36 


93 


200 


93 


93 


93 


93 


93 


93 


93 


93 


93 


36 


93 


Percentage win 


s 






















in: 
























100 decisions 


71 


63 


85 


72 


$7 


57 


75 


65 


75 


38 


86 


200 decisions 


82 


80 


89 


83 


75 


75 


83 


80 


83 


38 


91 


pi 00 


.100 


.106 


.091 


.092 


.106 


.101 


.093 


.104 


.096 


.095 


.110 


p? 


.097 


.090 


.100 


.092 


.105 


.105 


.100 


.092 


.091 


.093 


.098 


No. of thinking 
























steps for 200 
























decisions: 


23 


23 


25 


15 


29 


27 


18 


22 


21 


14 


29 



The dynamic case is perhaps the most interesting one ex- 
perimentally, especially where a fusion model is pitted against 
a human subject. Before going too far with such a program, it 
will be necessary to develop a better mathematical understanding 
of the models in order to design the experiments so as to permit 
statistical significance tests to be applied; this point has been 
discussed by Bush and Mos teller [5], and they and others are 
gradually developing some of the mathematical tools that are 
needed. We have some of these experiments under way with 
human subjects, using Morra and other games of about this com- 
plexity for the purpose. It would be interesting also to run some 
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trials of exactly the same sort with real rats (e.g., playing 
Morra against the stat-rat). 



9. AGAINST THE STAT-RAT 

We have seen how the stat-rat is able to play any game with 
known bounds on the payments, even though we have not been 
able to settle the question concerning its degree of skill at 
games. We shall now be interested in how best to exploit this 
knowledge of the procedure used by the stat-rat in playing games 
when we are its opponent. Of course, if we know the expectation 
functions and have computed the solution of the game, then we 
can guarantee at least a certain minimum result by choosing the 
game-theoretic solution; our object is to do better than this safe 
solution guarantees, and we also should like to know how to play 
when we do not know the expectation functions or the theoretical 
solution. 

As a special case, consider the ordinary game of matching 
pennies.' We start with the reasonable assumption that the ini- 
tial vector for the stat-rat is: 




To make the game quite definite, in our usual notation, we note 
that the game is usually represented by the expectation functions: 



We transform this game into an equivalent one, in the sense that 
linear transformations on the individual utility functions leave 
the solutions invariant, by setting: 

V ili 2 = (? + 1) = *' and 



No chance move is really needed, in this special case, since the 
values of V^ . are all zero or one. Finally, we specify that 



there is to be a sequence of N plays, and our problem is to 
choose a method of play that will maximize our expected pay- 
ments against the stat-rat. Since we can compute the p*(t) for 
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the stat-rat at each stage, except for the steps when p* has 
effect, it is not difficult to find a method of play that gives us an 
average expectation in excess of that obtained if we play strate- 
gies 1 and 2 with equal frequencies. Such a good strategy would 
be for us always to play the strategy that is less likely to be 
chosen by the stat-rat. We shall not pursue this very simple 
example further, except to note that it becomes immediately 
more difficult if we do not know pMO) or if the expectation 
functions are represented in the equivalent form: 

where < 6 < 1. 
2 



ev\ i 
l i l \ 



11. SUMMARY 

This is a preliminary paper. In it we have shown how a 
player can "learn 11 , during the course of a sequence of plays of 
a game, to improve his strategy. The fusion model developed 
by Bush and Mosteller to explain observed behavior of rats in 
experimental learning situations was used as the basis for both 
a theoretical and experimental investigation of the efficiency of 
this type of learning process in learning to play games. The ex- 
periments discussed here were with human subjects, and their 
game- learning performance was compared with that of the "stat- 
rat" represented by the fusion model with numerical values of 
the parameters estimated to fit experimental data for rats. 

The theoretical models accept basic assumptions of von Neu- 
mann- Morgenstern game theory and Bush-Mosteller learning 
theory, including: 

(a) Games with identical normal forms are equivalent, and 
this equivalence is independent of the probability distribution 
functions associated with chance moves. 

(b) Games that differ only by linear transformations of the 
individual payoff functions are equivalent. 

(c) Learning is a Markov process. 

Equivalence here means that the games have the same solutions. 
The experimental results consist of Monte Carlo computations 
for the stat-rat, contests between stat-rat and a human subject, 
and comparisons of performance of stat-rat and a human subject 
when playing the same static game. Very limited data indicate 
that: 

(a) The stat-rat usually learns a good strategy when a con- 
stant mixed- strategy is played against him. In Morra and the 
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other games played the stat-rat seemed to settle on essentially 
the best strategy within 200 trials or so* 

(b) A person proficient at games would win against the stat- 
rat in Morra. 

(c) The stat-rat does reasonably well in a static game, in 
comparison with the human subject, but a statistician would cer- 
tainly defeat the stat-rat. 

More extensive experiments are in progress, and it is hoped 
that these may provide the data necessary to estimate parameter 
values for human subjects and eventually to test the adequacy of 
this type of Markov process for description of human learning. 
It seems very unlikely now that such a Markov process will be 
adequate. 



FOOTNOTE 

1. This process has been most systematized for the art of de- 
cision by military commanders [10], [ll]. 
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CHAPTER XI 

REPRESENTATION OF A PREFERENCE 
ORDERING BY A NUMERICAL FUNCTION* 

by 

Gerard Debreu 
COWLES COMMISSION FOR RESEARCH IN ECONOMICS 

1. INTRODUCTION 

It has often been assumed in economics that if a set X (usu- 
ally in the finite Euclidean space of commodity bundles) is com- 
pletely ordered by the preferences of some agent, it is always 
possible to define on that set a real-valued order-preserving 
function (utility, satisfaction). This is easily seen to be false. * 

The particular case where there exists on X (the set of 
prospects) a certain algebra of combining (corresponding to 
the combination of probabilities) has been rigorously and ex- 
tensively studied by J. von Neumann and O. Morgenstern [7], 
J. Marschak [6], I. N. Herstein and J. Milnor [5]. 

But, rather paradoxically, the general case, which is more 
basic and simpler, has received little attention from economists. 
H. Wold's study [8] indeed seems to be the only rigorous one; 
its assumptions are however restrictive. 

This note gives conditions under which a complete order 



* Based on Cowles Commission Discussion Paper, Economics 
2040 (April, 1952). This article has been prepared under con- 
tract Nonr-358(01), NR 047-006 between the Office of Naval Re- 
search and the Cowles Commission for Research in Economics, 
to be reprinted as a Cowles Commission Paper. 

I am grateful to staff members and guests of the Cowles 
Commission and very particularly to I. N. Herstein for their 
comments. I owe to P. R. Halmos reference [4]. My greatest 
debt is to L. T. Savage who suggested, in the course of a valu- 
able discussion that Cantor's postulate x < zj < y (see Lemma 
II) might be weakened to x = zj = y. 
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(denoted by ) can be represented by a numerical function. 
The most common preference ordering in economics is that of 
bundles of n commodities, i.e., of points of an n-dimensional 
Euclidean space. We shall however treat the problem in a more 
general frame since this involves no additional mathematical 
cost. 

The familiar case of a set in a anite Euclidean space is 
covered by the following proposition which is a very special ap- 
plication of theorem II below: 

Let X be a completely ordered subset of a finite Euclidean 
space. If for every x' e X the sets {x Xlx ^ x'} ,(x X|x f ~x) 
are closed (in X), there exists on X a continuous, real, order- 
preserving function. 

The assumption that the set |x e X|x f $ x} is closed (in X) 
is equivalent to the more intuitive assumption: let (x^) be any 
sequence of points in X having a limit xe X, if for all k, x k 
is at least as good as x v , then x is at least as good as x v . 



2. TWO REPRESENTATION LEMMAS 

A complete ordering on X is, to be precise, a binary rela- 
tion, denoted <, satisfying 

1) Given any two elements x, y of X; x ^ y and/or y ^ x 

2) Given three elements of X such that x ^ y, y ^ z then x S z 

From this relation can be derived two new ones: 

x ~ y (x indifferent to y) if x ^ y and y ^ x 
x < Y (y better than x) if x ^ y and not y ^ x. 

The quotient set X/~ , i.e., the set of indifference classes in X, 
will be denoted by A. * The trivial case where all elements of 
X are indifferent (i.e. where A has just one element) will al- 
ways be excluded. 

The interval [x f , y f ] is the set {x X|x f ^ x^ y f } . 
The interval ]x f , y f [ is the set {x e X|x f < x<y'j . 

A real-valued function 0(x) defined on X is said to be order- 
preserving if x = y is equivalent to 0(x) 1* 0(y). 

A natural topology on X is a topology^ for which the sets 
{xe Xlxl x 1 }, |xe X|x f < x} are closed for all x f e X. 
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Lemma I. Let X be a completely ordered set whose quotient 
A is countable. There exists on X a real, order- preserving 
function, continuous 3 in any natural topology, 

Rank the elements of A; it is clearly possible to construct 
by induction on the rank an order-preserving function taking 

A into some finite real interval. Let X = Inf 0(a), ju = Sup 0(a). 

ae A ae A 

If a f satisfies X<a f < /i and a f { (A), four cases may occur: 
the set {a e 0(A)|a<a f } (1) may, or (2) may not, have a 
largest element; and the set {a e0(A)|a f < a} (l f ) may, or (2 f ) 
may not, have a smallest element. We wish to eliminate the 
gaps of type (l-2 f ), (2-1 1 ) and (2-2 f ); this can easily be done 
by means of a non-decreasing step function @(a), the height of 
each step being equal to the length of the corresponding gap. 
The new function 0*(a) = 0(a) -[0(a)] is still order-preserving 
and <t> *(A) has no gaps of the unwanted types. Denote by a(x) 
the indifference class a to which x belongs; we finally define 
0(x) = 0*[a(x)]. To show that <t> is continuous in any natural 
topology on X consider a number a 1 , X<a f < juand the set 
Xa' = {xeXl *(x)^a'f. 

1) If a' e 0(X), let x f e X be such that a 1 = 0(x f ). 
X0 = |xeX|xS x f [ and is therefore closed. 

2) If a'< 0(X) and if the set R^ = {ae 0(X)|a<a f } has a 
largest element a", X^ 1 = X^" which is closed by 1). 

3) If a f I (X) and if the set Ity has no largest element, 
then the set R** 1 = {a e 0(X)|a f < a} has no smallest element 
since 0(X) has no gap of type (2-l f ). Thus X cr i = Q Xa 

aeR a ' 
and Xgt is closed as an intersection of closed sets. 

Similarly one proves that for any number a 1 the set 
X ' = |xe X|a f ^ 0(x)} is closed. It follows that the inverse 
image by of any closed set of the real line R is a closed set 
of X. 



Let X be a completely ordered set, Z = (Z Q , z\ 9 * ) 

a countable subset of X. If for every pair x,y of elements of X 
such that x < y, there is an element zj of Z such that x j* zj ]Ty, 
then there exists on X a real, order-preserving function, con- 
tinuous in any natural topology. 

The assumption made is a weakening of the postulate 
(x<zi<y) used by G. Cantor in [3], 

Take first the quotient sets X/^ = A and Z/~ = C. C is 
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clearly countable and plays for A the role that Z played for X. 
If A has a smallest and/or a largest element, we can assume, 
without any loss of generality, that they are contained in C. 

Define a new equivalence relation^ among elements of A by: 
aFb if and only if between a and b there is a finite number of 
elements of A, The binary relation F is indeed reflexive, sym- 
metric and transitive. Equivalence classes for F are denoted 
by [a] F , fb] F , - 

Every equivalence class is clearly countable. Moreover an 
equivalence class [c]p containing more than one element of A 
contains an element of C and thus the equivalence classes [c]p 
form a countable set. Summing up, C f the union over these 
classes [c]p, is countable and so is D = CUC ! . 

Construct now on D the function 0*as in the proof of 
Lemma I. 0*is extended from D to A as follows. Let ae A 
and a^ D; the set D a = {de D|d<af has no largest element. 
To see this consider any d f e D a . Since a$ D, a^ C f and there 
is an infinity of elements of A between d f and a, there is 
therefore an infinity of elements of C, i.e. of D, between d' 
and a. Similarly the set D* = {de D|a<d| has no smallest 

element. As a consequence the values Sup *(d) and 

deD a 

Inf *(d) are not taken on. Moreover these two values are 
deD* 

equal since 0*(D) has no gap of the (2-2 f ) type; they define 
0*(a). The function 0*(a), and therefore the function 
0(x) = 0*[a(x)], are clearly order-preserving and, since 
0(X) = 0*(A) has no gaps of types (1-2 1 ) or (2-l f ), (x) is 
continuous in any natural topology on X (the proof is the same 
as for Lemma I). 



3. TWO REPRESENTATION THEOREMS 

Before stating Theorem I we recall two definitions. A (topo- 
logical) space X is separable if it contains a countable subset 
whose closure is X. A (topological) space X is connected if 
there is no partition of X into two disjoint, non-empty, closed 
sets. 

Theorem I. Let X be a completely ordered, separable, and 
connected space. If for every x f X the sets {xe X| x S x y j and 
{x XI x y i x) are closed, there exists on X a continuous, real, 

order-preserving function. 
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This theorem can easily be derived from the results of 
S. Eilenberg [4]. It will be proved here as an immediate con- 
sequence of Lemma II. A much more direct proof could as- 
suredly be given: the motivation for the two lemmas is 
Theorem II, 

Call Z the countable set dense in X and consider a pair 
x T ,y' of elements of X such that x ! <y'. The sets jxe X|x^ x ! f 
and |x X|y f ^ x } are disjoint, non-empty and closed, they can- 
not exhaust X which is connected, therefore the open interval 
]x f ,y ? [ is not empty, and it must contain an element z{ Z. The 
theorem is proved since the topology on X is a natural topology. 

The assumption of connectedness is however very strong. 
We give a second theorem where it is removed at the cost of a 
slightly stronger separability assumption. 

A topological space X is perfectly separable if there is a 
countable class (S) of open sets such that every open set in X 
is the union of sets of the class (S). 

We remark that a separable metric space is perfectly sepa- 
rable, that a subspace of a perfectly separable space is per- 
fectly separable. 

Theorem II. Let X be a completely ordered, perfectly sepa- 
rable space. If for every x* X the sets jxe Xlx * x'j and 
Ixe X|x* jE x} are closed, there exists on X a continuous, real, 
order-preserving function. 

Choose an element in each non-empty set S; they form a 
countable set Z". 

Consider then the pairs a f ,b f of elements of A such that 
a ! < b f and the interval ]a',b f [ is empty. The set of those pairs 
is countable. To see this, associate with each such pair a set 
Sfo 1 as follows: take two elements x f ,y f in the indifference 
classes a f ,b f respectively. The set |xX|x<y f f is open and 
therefore there exists a set S^ in the class (S) such that 
x f S b tC |x X|x<y f }. If a M ,b" is another pair with the same 
properties, Sfc" is different from S^i for one has a 1 <b f ^ a"<b" 
in which case x"e S^M and x"^ S& or a"< b" ^ a'< b 1 , in which 
case x f 6 S^t and x f I S^". The pairs a f ,b f are thus in one-to- 
one correspondence with a subclass of the countable class (S). 
Choose then an element x f in each class a f and an element y v 
in each class b f . All those x f and y f form a countable set Z f . 

Consider finally the countable set Z = Z f U Z"; it has all the 
properties required by Lemma II. Let x,y be a pair of elements 
of X such that x<y. If the open set ]x,y[ is not empty, it con- 
tains a non-empty set S and therefore an element of Z". If the 
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set ]x,y[ is empty, x~x f Z f and y~y f Z 1 . So that in any 
case [x>y] contains an element of Z. 



FOOTNOTES 

1. Consider the lexicographic ordering of the plane: a point of 
coordinates (a',b f ) is better than the point (a,b) if *a f >a" or 
if c< a f =s a and b f > b". Suppose that there exists a real order- 
preserving function o?(a,b). Take two fixed numbers b^<b2 
and with a number a associate the two numbers a i(a) = a (a,bj) 
and o^tejtyj)* To *- wo Different numbers a,a f correspond two 
disjoint intervals [i(a), a 2 (a)] and [ai(a f ), o?2(a f )]. ne ob- 
tains therefore a one-to-one correspondence between the set of 
real numbers (non- countable) and a set of non-degenerate dis- 
joint intervals (countable^ 



2. For definitions relating to an equivalence relation see [l]. 

3. For definitions of a topology and of a continuous function see 
[2, I] and [2, 4] respectively. 

4 The closedness assumptions have already been used in a 
similar context by I. N. Herstein in an earlier unpublished ver- 
sion of [5]. 
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CHAPTER XII 

MULTIDIMENSIONAL UTILITIES 

by 

Melvin Hausner 
RAND CORPORATION 

1. INTRODUCTION 

This paper generalizes the von Neumann and Morgenstern 
theory of utility by omitting the Archimedean postulate. The 
work was done originally at The RAND Corporation in the sum- 
mer of 1951 by N. Dalkey and R. M. Thrall. Refined methods 
were introduced by J. G. Wendel and the author in order to 
simplify the work and extend it to the infinite dimensional case. 

Two distinct concepts enter into a formulation of utility 
theory: the set of prospects, which we shall call the mixture 
space and the ordering or utility on this set. For convenience 
we shall treat the mixture space first (axioms Ml - M5) and 
then introduce the ordering axioms (Ol - O3). The two sets of 
axioms, taken together, characterize a utility space. As an in- 
termediate step, we shall consider a weak utility space, where 
axioms Ol - O3 are weakened. The weakened axioms are still 
strong enough to permit identification of "indifferent" elements 
and still preserve the operations of mixture and order. 

The main result is that a mixture space may be embedded in 
a vector space; a utility space may be embedded in an ordered 
vector space. The last section characterizes ordered vector 
spaces. 

2. ALGEBRAIC PRELIMINARIES 

We now introduce the following notation, to be useful in what 
follows. Let S be any subset of a vector space V over the real 
numbers. We define: 

C(S) = the convex closure of S 

= the set of elements X V which are of the form 
X = ZxiSj where Xj >0, Zxj = 1, Si e S. 

167 
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P(S) = the cone generated by S 

= the set of X e V which are of the form X = 

where xj > 0, Sj c S. 
H(S) = the hyperplane generated by S 

= the set of X e V which are of the form X = 

where Zxi =1, S^ e S. 
V(S) = the vector space generated by S 

= the set of elements X of the form X = ZxjSi where 

Sj S. 

The above sums are taken to be finite. We note that C(S) S P(S), 
H(S) c V (S). 

I. The set P(S) is closed under addition and scalar multi- 
plication by positive reals. Such a set is called a cone. In the 
course of various embeddings to be carried out, it happens that 
it is frequently more natural to embed in a cone rather than in 
a vector space. A cone C may be characterized by the follow- 
ing properties (see 3, Pi - P8 for the complete set of axioms): 

1. C is a commutative semigroup with cancellation under the 
operation +. 

2. C is closed under scalar multiplication by positive reals. 
The usual associative and distributive laws are satisfied. 

If C is a cone, it is possible to embed C in a vector space V 
so that addition and scalar multiplication in C may be extended 
to V and so that C generates V. This embedding is unique and 
may be accomplished by the familiar method of first embedding 
C in a group C f under + (cf. [3], p. 43, where a ring is embedded 
in a field). The group C 1 may be made into a vector space by 
defining 

x(A-B) = xA - xB; x > 
O(A-B) = ; 

-x(A-B) = xB - xA; x > 0, A, B C. 

We may then verify that C f is a vector space V and C = P(C), 
V = V(C). 

II. If S is a convex subset of V(C(S) = S), then P(S) consists 
of elements of the form xA where x >0 and A e S. We seek a 
condition (given that S is convex) that this representation is 
unique. First, suppose the representation is not unique. Then 
X 1 A 1 = X 2 A 2 w here xj ^ X2 or xj = X2 and Aj ^ A 2 - In the 
first case, we may assume (by dividing by xj - x 2 ) that xj - x 2 = 
1. Then = xjAj - X2A2H(S). The second case Xj = x 2 > is 
impossible. Conversely, suppose H(S). Then = SXjAj 
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where Exj = 1. By transposing, if necessary, we obtain 
SyiAj = SziAj where yj > 0, Zi > and Zyi ^ Zzj. This gives 
two different representations of the element SyjAi in the form 
xA, Ae S, since the x may be taken to be Syj and Szi. Thus, 
a necessary and sufficient condition that any element in P(S) be 
represented uniquely in the form xA is that / H(S). 

III. We set down some definitions concerning orderings for 
future reference. 

A relation R on a set M is said to be complete if for every 
pair A,B in M either ARE or BRA holds. A relation < on M 
is said to be a weak order if it is reflexive, transitive and com- 
plete. A relation i on M is said to be a chain order if it is 
antisymmetric, transitive, and complete. If is a chain order 
and we define A < B to mean A ^ B but A ^ B then < is easily 
seen to be asymmetric, transitive and trichotomous (i.e. exactly 
one of A < B, A = B, or B < A holds). Conversely, if < is 
asymmetric, transitive, and trichotomous and A ^ B is defined 
to mean A < B or A = B, then ^ gives a chain order. If either 
of the symbols < or ^ has been introduced we will consider the 
other to be defined as above. We use > and ^ to denote the 
transposes of < and ^ , i.e. B > A means A < B and B A 
means A ^ B. 



3. MIXTURE SPACES 

A mixture space is a set M = |A,B, . . .} which satisfies the 
following axioms: 

Ml. For any A,B M and for any real p, ^ p ^ 1, the p 
mixture of A and B, denoted by ApB, is a uniquely 
defined element of M. 

M2. ApB = B(l-p)A 

M3. Ap(BrC) = (A f B)(p+r-pr)C, [(l-p)(l-r) ^ l] 
p+r-pr 

M4. ApA = A 

M5. If ApC = BpC for some p > 0, then A = B. 

Taking r = and B = C in M3, we have with the help of M4 

ApB = Ap(BOB) = (AlB)pB. 
By the cancellation law (M5) we obtain 
M6. A1B = A. 

If V is a vector space, we may define ApB = pA + (l-p)B. It 
is then easily shown that V is a mixture space. We shall show 
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that this is the most general mixture space: any mixture space 
is isomorphic to a convex subset of a vector space. 

The embedding of M will be accomplished by first embedding 
M in a cone P and then, as indicated in ?2, embedding P in a 
vector space V. We shall be concerned only with the embedding 
in a cone. For convenience, we set down the axioms for a cone 
which were suggested in $2. A cone P is a set |A,B, . . .} sat- 
isfying the following axioms: 

PI. There is an operation + in P such that A + B is a 
uniquely determined element of P, where A,B 6 P. 
Scalar multiplication by positive real numbers x is 
defined so that xA is a uniquely defined element of P. 

P2. A + B = B + A 

P3. A + (B+C) = (A+B) + C 

P4. If A + B = A + C then B = C 

P5. x(A+B) = xA + xB, x >0 

P6. (x+y)A = xA + yA 

P7. x(yA) = (xy)A 

P8. 1-A = A 

Given the mixture space M, we define the set P to be the 
space of all ordered couples (x,A) where x>0, A e M, and we 
formally write (x,A) = xA. Thus xA = yB if and only if x = y 
and A = B. (Later, A will be identified with 1-A and this defi- 
nition of equality simply places M in V in such a way that 
/ H(M). An extraneous dimension is thus introduced.) 

The set P being defined, we now define addition and scalar 
multiplication in P. 

Dl. r(sA) = rs(A) 

D2. rA 4 sB = (r+s)A B. 

r+s 

And we now verify that P is a cone. 
PI. is trivial. 
P2. rA + sB = (r+s)A-j^HB (D2) 

= (s+r)B-~A (M2) 

I TO 

= sB + rA (D2). 

P3. rA + (sB+tC) = rA + (s+t)B-^rC 

S+i 
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Similarly, 

(rA+sB) 

With the help of M3 it may be verified that 

A 
r+s+t s+t 

which proves P3. 

P4. If rA -- sB = r A + tC, then 



Hence, s = t and by M5, B = C. Thus sB = tC proving P4. 
P5. x(rA+sB) = x(r+sXA B) 




xr+xs 

= (xr)A -f (xs)B 
= x(rA) -- x(sB) 
P6. (x+y)rA = (xr^yr)A 

XT* 

(M4) 

= (xr)A + (yr)A 

= x(rA) + y(rA). 

P7. x(y(rA)) = x(yrA) = (xyr)A = (xy) (rA) 
P8. l-(rA) = (l-r)A = rA. 

P is thus a cone. If A e M, we define f(A) = 1 A e P to 
obtain the following lemma. 

Lemma 3.1. There is a function f mapping M into a cone P 
in such a manner that 

1) f is 1-1 into P. 

2) f(ApB) = pf(A) + (l-p)f(B) for < p < 1. 

Proof; Define f(A) = 1 A. Then 1) is a consequence of 
definition of equality in P and 2) is verified by the following 
computation: 
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f(ApB) = 1-ApB 

= (p+(l-p))ApB 
= pA+(l-p)B 



= p(A) + (l-p)f(B). 

As in ? 1, the cone P may be tal:en to be a cone in some 
vector space V which is generated by P. The statement of the 
embedding theorem follows. 

Theorem 3.2. Let M be a mixture space. Then there exists 
a vector space V over the real numbers and a function f map- 
ping M into V such that 

1. f is one-one into V. 

2. f(ApB) = pf(A) + (l-p)f(B), p 1. 

3. f(M) = C(f(M)), i.e., f(M) is convex. 

4. V = V(f(M)). 

5. 0/H(f(M)). 

Proof: Let V be generated by the cone P of Lemma 1. 
Then 1. and 2. are true by Lemma 1. (The cases p = and 
p = 1 are trivial by M6.) To prove that f(M) is convex, observe 
that if A 1 , B ! e f(M), so that A 1 = f(A), B f = f(B), then 
pA f + (l-p)B 1 = f(ApB) f(M) by 2. Hence f(M) is convex. 
Since f (M) generates P (because the elements 1 A in the proof 
of Lemma 1 generate P) and P generates V we have 4. As 
for 5., we need to prove that if xf(A) = yf(B) for x,y >0 then 
x = y and f(A) = f(B). In terms of the cone P of Lemma 1, we 
have xA = yB so that x = y and A = B, proving the result. 

We now prove that the embedding is unique. To do this we 
first prove the following algebraic lemma. 

Lemma 3.3. Let V and V 1 be vector spaces with elements 
x,y,z. . . and x f ,y f ,z f . . . respectively. Let C and C f be con- 
vex subsets of V and V f respectively and let / H(C), / H(C f ), 
V = V(C), V 1 = V f (C). Let g map C onto C f in such a way that 

a. g is one-one from C onto C 1 . 

b. g(pA+(l-p)B) = pg(A) + (l-p)g(B) for 0< p < 1 and 
A,B C. 

Then g may be extended in one and only one way to all of V 
such that 

a 1 , g is one-one from V into V 1 . 
b'. g is linear. 
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(Thus, if X = SXiAi, AI C, then g(X) = 2xig(A i ), and this ex- 
tension of g is uniquely defined and satisfies a', and b 1 .) 

Proof; Obviously there is at most one extension. Let us 
first extend g to P(C), by defining g(xA) = xg(A) where x>0 
and A e Co (Observe that since C is convex, and / H(C), xA 
is the most general element of P(C). There is no question of 
whether g is properly defined, since the representation xA is 
unique. 

First, g is one-one onto P(C'). g is one-one since if 
xg(A) = yg(B) with x,y >0, A,B e M, we have g(A), g(B) e C f 
and two representations of an element in P(C ! ). Hence x = y 
and g(A) = g(B), and therefore A = B. That g is onto P(C f ) is 
clear. 

Next, g is additive on P(C). For we have 

g(xA+yB) = 



= (x + y)[g(A)--g(B)] (by b) 



= xg(A) + yg(B) 
= g(xA) + g(yB). 
Finally, g is homogeneous. For, 

g(x(yA)) = g(xyA) 
= xyg(A) 
= xg(yA). 

We now extend g to V. Let R, S, T, U, . . . denote elements 
of P(C). The most general element X of V is of the form 
X = R - S. Define g(x) = g(R) - g(S). g is properly defined, 
since ifR-S=T-U then R + U = S + T, g(R) + g(U) = g(S) + g(T) 
(by additivity of g on P(C) ) and hence g(R) - g(S) = g(T) - g(U). 
We now prove a 1 and b f . 

It is easily verified, from the additivity and homogeneity, that 
g is linear. To prove that it is one-one, assume g(X) = 0. Let- 
ting X = R - S, we have g(R-S) = 0, g(R) = g(S). Hence, since g 
was shown to be one-one on P(C f ), R = S and S = 0. Finally, g 
clearly maps V onto V f and this proves the lemma. 
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Theorem 3.4 (Uniqueness). Let M be a mixture space and 
let V and V f be vector spaces. Suppose f maps M into V and 
F'maps M into V f as in Theorem 3.1. "Then V and V 1 are 
isomorphic under a mapping g which sends an element X = 
Zxjf (Aj) (Aj M) into g(XyTZxjf f (Ai). 

Proof: By Lemma 3.3 applied tu the function g = ff*. 

The above discussion has not depended on the dimension of V 
If the (essentially unique) V of Theorem 1 is (n+1) -dimensional, 
then we say that the mixture space M is n-dimensional. In this 
case, M is simply an n-dimensional convex set. 



4. UTILITY SPACES 

A utility space is a mixture space M with an order relation 
^ imposed on its structure. In accord with 2 3, we assume that 
M is a convex subset of a vector space V with V = V(M), 
i H(M). The relation ^ is required to satisfy the following 
axioms: 

01. The relation ^ is a chain order on M. 

02. If A < B and < p < 1, then pA + (l-p)C < pB + (l-p)C. 

Observe that the converse of O2 holds since < is a trichotomous, 
asymmetric ordering. 

Before considering utility spaces, we shall consider weak 
utility spaces. These are mixture spaces with an ordering < 
satisfying 

Wl. The relation < is a weak order on M. 
W2. If A< B then ApC < BpC for 0< p< 1. 
W3. If ApC < BpC for some p such that < p< 1, then 
A<B. 

Definition 4.1. A - B if and only if A< B and B < A. 

It is easily verified that this is an equivalence relation. Let 
[A] be the equivalence class containing A, i.e., X e [A] if and 
only if X ~ A. 

Definition 4.2. [A] ^ [B] if and only if A < B. 

It is then seen that the relation ^ on the equivalence classes 
is a chain order. Let M f be the set of equivalence classes. We 
make M f into a mixture space by defining 

[A]p[B] = [ApB]. 
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With the help of W2, it is easily verified that this definition is 
unique. That M f is a mixture space follows from the fact that 
M is one. 

To prove O2, assume 0<p<l and [A]<[B]. Since A^B 
we have pA + (l-p)C < pB + (l-p)C. We need only prove that 
pB + (l-p)C pA + (l-p)C. But assume that 

pB + (l-p)C<pA + (l-p)C. 

By W3, we would have B < A, hence A ~ B, [A] = [B] which 
contradicts [A]<[B]. Hence O2 is verified. 

We see that W2 was needed to define mixtures, W3 to define 
order. 

Given a weak utility space M, we identify elements by the 
above process to obtain a utility space M 1 . Any information ob- 
tained on M 1 will then be reflected in M. For this reason we 
consider only utility spaces. 

An ordered vector space V is one which satisfies the follow- 
ing axioms: 

VI. There is a chain order on V 1 . 

V2. If A >B then A + C> B + C for A,B,C, e V. 

V3. If A >B and x>0 then xA >xB. 

An ordered vector space is thus a utility space. Axioms 01 
and O2 are easily seen to hold. The purpose of this section is 
to embed a utility space in an ordered vector space. Since the 
utility space M is embedded in the vector space V as a mixture 
space, it suffices to extend the order relationship from M to V. 
We assume that V = V(M) and i H(M). The order relation will 
first be extended to P = P(M) and then to V = V(P). 

To extend the order on M to P = P(M), we define 

xA > yB if and only if x > y 
or 

x = y and A >B for x,y > 0; A,B M. 

We now verify VI, V2 and V3. 

VI is trivially verifiable. 

V2. Assume xA >yB. We must prove that xA + zC >yB + zC. 

But xA + zC = (x+z) A-^-C and yB + zC = (y+z)B-^-C. If x >y, 

x+z y+z 

we have the result, since x + z>y + z. Ifx = y, then x + z =y + z 
and since A >B, we have AC >B-2L-C by O2. Hence 



xA + zC> yA + zC in any case. 
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V3. Let xA > yB. Then the condition rxA > ryB is equivalent 
to the condition xA>yB. This proves V3. 

Finally, the order on P will be extended to V by defining 

A-B>C-D if and only if A + D > B + C for A,B,C,De P. 

It is easily seen that the definition is unique and we omit this 
proof. We now verify the axioms VI, V2 and V3. 

VI is trivially true. 

V2. Let A - B > C - D. We must prove that 

(A-B) + (E-F) > (C-D) + (E-F) for A,B, . . . F e P. 

But we have A + D >B + C. Hence A + D + E + F>B + C + E + F. 
By definition, we have (A+E) - (B+F) > (C+E) - (D+F). Hence, by 
rearranging, we have the result. 

V3. If A - B >C - D and x > 0, we have 

A + D >B + C, 

xA + xD >xB + xC, 

xA - xB >xC - xD, 

x(A-B) >x(C-D). 

Thus, the utility space M is embedded in an ordered vector 
space V. As we have mentioned before, an extraneous dimension 
has been introduced since { H(M). The dimension of M has 
been defined as the dimension of H(M). Once the above embed- 
ding has been carried out it is an easy matter to embed M in a 
vector space of its own dimension. To do this we simply embed 
H(M) D M in a vector space by selecting any point A e H(M) and 
defining f(X) = X - A for X e H(M). The image of H(M) is then 
a vector space whose dimension is H(M). Moreover, f is one- 
one and f preserves convex combinations and order. This 
seemingly roundabout method of introducing an extra dimension 
greatly simplifies the embedding procedure since all of our em- 
beddings seem to be accomplished easily by first extending defi- 
nitions to P(M) where i H(M). Lastly, it should be pointed out 
that the "proper" embedding should be thought of as an embed- 
ding in an affine space, since the relations of p-mixtures and 
order are preserved under the affine group. We use a vector 
space for convenience in manipulation. 
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5. ORDERED VECTOR SPACES 

We now proceed to characterize ordered vector spaces. The 
definition has been given in ?4 by axioms VI, V2 and V3. It is 
convenient to introduce equivalent substitutes for V2 and V3 as 
follows: 

V2 f If A >0, B >0 then A + B >0 
V3' If A >0, x >0 then xA >0. 

We have 

A > B if and only if A - B > . 

Thus the order is determined by the positive elements which we 
denote by V+. V2 ! and V3 ! imply that V+ is a cone. Since the 
order is a total order and is irreflexive, V+ is a maximal cone 
not containing 0. 

We now introduce a definition which reflects the failure of the 
Archimedean property. Let A, B V+ (A,B> 0). Then we say 
that A dominates B if A >xB for all real x >0. We write this 
as A B or B A. This is defined only for positive ele- 
ments of V. The relation is irreflexive and transitive; and 
AB implies A < B Given A and B, if neither AB nor 
B A we write A ~ B: A is equivalent to B. This is the same 
as saying A ~ B if and only if xA < B < yA for some positive 
reals x and y. Again this is defined for A, Be V+. The relation 
~ is seen to be an equivalence relation. We denote by [Al the 
equivalence class containing A. We may then define [A] >[B] to 
mean A B. We then have a total order on the equivalence 
classes, since this definition may be seen to depend only on the 
equivalence classes and not on their representatives. The 
seemingly irrelevant reversal of order is for later convenience. 

Lemma 5.1. If A B and C >0 then A + C B. 
Proof: If x > 0, then A > xB. Hence A + C> xB. 

Lemma 5.2. If A> > Aj,A2, . . . , A n and xj, . . . , x n >0, 
then A xj Aj + . . . + x n A n . 

Proof: If x > we have 

A > nx xjAj 
Summing and dividing by n, we have 

A > x(xj AI + . . . x n A n ) , 
proving the lemma. 
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Lemma 5.3. If A Aj, . . . , A n and x,xj, . . . , x n > 0, then 
xA + XjAj + . . . + %A^ Xfc+iAj^i + . . . + x n A n . 
Proof: By Lemmas 5 1 and 5.2. 

Corollary 5.4. If {Aj } is a set of elements of V+ no two of 
which are equivalent, then the Aj re linearly independent. 

Proof: If there is a linear relation among the Aj, then we 
will obtain a relation of the form 

xA + XjAj + . . . + x k A k = x k +iA k +! + + x n A n , 

where the x's are positive, the A's belong to the given set, and 
A > > Aj, . . o , A n . But this contradicts Lemma 5.3. 

As usual, we define | A | = 1 A according as A is non-negative 
or negative. The usual laws apply: 

|A +B| i |A| + |B| 

IxAl = |x||A|. 
With this notation we may state the following important lemma: 

Lemma 5.5. Let A ~ IB I . Then there is a unique real num- 
ber x such that xA = B or |xA - Bl A. 

Proof: The uniqueness of x is immediate, for if xj ^ \2 
we have 

|(x 1 -x 2 )Al $ |xjA - yl + |x 2 A - y|, 

and if the terms on the right were zero or dominated by A we 
should have A A. 

To prove that such an x exists, we assume that B > with 
no loss in generality. Since A ~~ B, we have yA< B < zA 
for some positive real numbers y and z. Let x be the su- 
premum of the numbers y for which yA< B. Let >0 be an 
arbitrary positive real number. Then we have 

(x-)A<B<(x+)A, 
-A<B - xA< A, 

IB - XA|< A. 

Then either B - xA = or | B - xA| A by definition. 

We may now easily characterize finite dimensional ordered 
vector space. The result is known (see p. 240, [l]) but we give 
it here as an illustration of the method used for the general (in- 
finite-dimensional) case. 
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Theorem 5.6. Let V be a finite dimensional ordered vector 
space. A basis Aj,A2, . . . , A n may be chosen so that the 
ordering in V is lexicographic, i.e., so that 



= l 



A = li A i >0 
i=l 

if and only if the first non- vanishing Xj is positive. 

Proof: Let V+ be decomposed into equivalence classes as 
above. For each equivalence class t, let A^ be an arbitrary 
element of it. By Corollary 5.4, the set { Aj } is finite. We 
choose the notation so that Aj, . . . , A^ are the representatives 
and AI A2 . . . Aj c . Moreover, k ^ n = dimension of V. 

Let A V. If A ^ then |A I belongs to some equivalence 
class, say |A| ~ A. By Lemma 5.5, either A = xjA or 



| A - xjA^-J < < AJ-. We repeat the process on A - xjA^- if it 

is not zero and continue until the zero element is reached. (It 
must be reached in ^ k steps.) Thus A is a linear combination 
of the Aj's; and since A was arbitrary it follows that the A's 
constitute a basis for V. Thus k = n. 
Now let 

A ^ 



By Lemma 5.3, A >0 if and only if the first non- vanishing co- 
efficient Xi is positive. This completes the proof. Observe that 
a basis Aj, . . . , A n is a lexicographic basis if and only if 
AI > and A! . . . A n . 

Theorem 5.7. Let Aj, . . . , A n and A\, . . . , A^ be basis 
elements in the sense of Theorem 5.6. Then A- = TAj where T 
is a lower triangular matrix with positive diagonal elements. 
Conversely, if T is such a matrix and Aj, . . . , A n constitute 
a lexicographic basis then so do TAj, . . . , TA n . 

Proof: We have Aj ... A n and A^ ...> >A' n . 
Hence Aj ~ A\. Thus A\ = XjAj + terms dominated by AJ by 
Lemma 5.5. A\ = XjAj + x^jA^j + . . . where xj > 0. This 
proves the first part of the theorem. For the second part, we 
observe that TAj ~ A t . Hence TAj ... TA n proving 
the second part. 

In terms of coordinates, we may state Theorem 5.6 as follows: 
With respect to some basis, the vector X = (xj, . . . , x n ) > if 
and only if Xj > or xj = 0, X > 0, . . . , or xj = . . . x n-1 = 0, 
x n >0. 
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In order to consider the infinite dimensional case, we define 
a lexicographic ordered vector space as follows: Let T be a 
totally ordered set. Let V T be the set of all real valued func- 
tions on T which vanish except on some well ordered subset of 
T. We define f >0 if f ^ and if f^) >0 where t o is the first 
t for which f(t) ^ 0. It is easily verified that V T is an ordered 
vector space. For T finite, we get the finite-dimension ordered 
vector spaces. It may then be shown that the ordered set T is 
order- isomorphic to the ordered set of equivalence classes in 
VT- If V is any ordered vector space, we let T be the set of 
equivalence classes with the previous definition of order. Then 
the result is that V is embeddable in a subspace of VT which 
contains the characteristic functions of points. The proof is by 
transfinite induction. The details appear in [2]. 
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CHAPTER XHI 

APPLICATIONS OF 
MULTIDIMENSIONAL UTILITY THEORY 

by 

Robert M. Thrall 
UNIVERSITY OF MICHIGAN 



In the preceding chapter a set of axioms was given for a 
(non- Archimedean) utility space. This chapter is a continuation 
and amplification of the preceding one. In the first part the axi- 
oms are studied with special attention to the consequences of 
weakening certain of them. In the second part several applica- 
tions of non- Archimedean utilities are presented. 



1. DISCUSSION OF THE MIXTURE SPACE AXIOMS 

We recall that a mixture space is a set M = {A, B, . . . } of 
elements called prospects, which satisfies the following axioms: 

Ml. For any A, B e M and for any real p, ^ p ^ 1, the p- 
mixture of A and B, denoted by ApB, is a uniquely 
defined element of M. 

M2. ApB = B(l - p)A 

M3. Ap(BrC) = ( A p JLpr' B)(p * r " pr)C 

M4. ApA = A 

M5. If ApC = BpC for some p > 0, then A = B 

The closure axiom Ml is clearly essential for any mathe- 
matical development. In interpreting this axiom in the real 
world we conceive of ApB as being the prospect of being given 



*Most of the results described in this paper were obtained under 
Contract Nonr: 374 (00) with the Office of Naval Research, but 
some of them (especially in section 3) spring from work done at 
RAND in the summer of 1951. The author wishes to acknowledge 
the contribution made by Dr. Norman Dalkey of RAND Corpora- 
tion during a series of informal discussions on the general sub- 
ject of utility theory. 
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exactly one of the propects A and B, with the probability p of 
getting A. Thus, automobile 1/100 candy bar could be realized 
by a ticket in a 100-ticket lottery in which there was to be one 
automobile prize and 99 candy bar prizes. This lottery inter- 
pretation fails if p is an irrational number. For example, if 
p = I/TT then ApB cannot be interpreted as a lottery prize. 
Again, although one can construct lotteries for such prospects 
as A .55555 B and A .55556 B, it is questionable whether an 
individual could distinguish between them in any natural setting. 
Clearly, in any real-world case there is a limit to discrimination 
just as there are natural limits to the accuracy of measurement 
of length. Limitation of discriminatory capacity cannot be any 
more serious as an objection to the use of real numbers p in 
this theory than it would be in the theory of physical measure- 
ments. However, in applications of this theory one must take 
care to avoid using conclusions which depend essentially on 
exact discrimination. 

If in any interpretation axiom Ml is accepted then axioms 
M2 and M4 are relatively uncontroversial. Axiom M3 is a 
kind of associative law which will certainly be valid in any ap- 
plication if the judge proceeds according to actuarial principles. 
However, its truth in a psychological sense is far from evident. 
It would seem desirable, therefore, to test it experimentally to 
see at least how the errors involved in using it in a real world 
situation compare with errors that come from discriminatory 
limitations. 

A recent experiment, reported in [2], seems to indicate that 
evaluation of a prospect ApB depends not only on the value of p 
but also on the manner of presentation. For example, consider 
two punch boards each with one winning place, the first having 
two holes and the second having twenty holes. The probabilities 
of winning with one punch on the first board and of winning with 
ten punches on the second are both one-half, but these two pre- 
sentations of p = 1/2 are not considered equivalent by all sub- 
jects. This experiment indicates a definite limitation to the 
range of applicability of axiom Ml. 

Axiom M5 is a cancellation law and is included for logical 
clarity. The point at issue here is not whether the judge is in- 
different between ApC and BpC, but is rather what one means 
by distinct prospects. Its role is logical rather than psychologi- 
cal and its acceptance entails no psychological assumptions. 
Axiom M5 serves to "glue" the space together. Without M5 
we could have the following possibility. Let M consist of three 
prospects A, B, C where A = pencil, B = apple, C = nothing. 



MULTIDIMENSIONAL UTILITY THEORY 183 

Now set ApB = BpC = ApC = BpC = C for all p with < p < 1, 
and define X1Y = YOX = X for all X,Y in M. This system satis- 
fies Ml, M2, M3, M4, but not M5, and might even arise in prac- 
tice for an individual who would evaluate as worthless all mix- 
tures involving two distinct prospects. However, we can hardly 
expect a mathematical utility theory to explain such behavior. 

In summary, so far as applications to the real world are con- 
cerned, the crucial mixture axioms are Ml and M3; in both of 
these the psychological content is open to question and, therefore, 
also to experimental study. It will possibly turn out that these 
axioms are better fitted for analysis of command level group 
judgments than for that of individual judgments. 



2. THE ORDER AXIOMS 

Wl. The relation is a weak order on M. (Cf. Chapter XII.) 
W2. If A < B then ApB BpC for < p < 1. 
W3. If ApC < BpC for some p such that 0<p <1, then 
A < B. 

If we have both A < B and B < A we write A ~ B and say 
that A is indifferent to B. Indifference is an equivalence rela- 
tion on M. 

In many ways Wl is the order axiom most open to question. 
A mixture space has an infinite number of elements and the judge 
is asked to make judgments of preference or indifference between 
each pair of these elements, and these judgments are to be tran- 
sitive. A prospect A is said to be pure if it is not the mixture 
BpC, < p < 1, of any two other prospects B and C. If the mix- 
ture space is generated by a finite set of pure prospects one has 
more hope concerning the possibility of imposing an order rela- 
tion. [A set S of prospects is said to generate the entire space 
M if for each prospect C in M there exist probabilities 
Pl> Pr anc * elements Aj, . . ., A r in S such that C = 

A lPl ( A 2P2< ( A r-lP A r) "'))] 

From the practical point of view, if the number of judgments 
needed is finite but large, there is still the time difficulty. By 
the time the judge has reached the 1,000,000th choice, his stand- 
ards of comparison are almost certainly not the same as initially. 
The theory calls for instantaneous and simultaneous judgments 
between all pairs, and applications should be limited to cases 
where there is some evidence that the judgments used have been 
based on reasonably constant standards. 
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In a way, axioms W2 and W3 help simplify the judgment 
problem; in principle, they show how to reduce greatly the num- 
ber of judgments required. For instance, if M is generated by 
a finite number of pure prospects one can get by with a finite 
number of judgments. However, in spite of their obvious im- 
portance, there are some objections to these axioms. 

For example, consider the prospects A = win fiVe dollars, 
C = lose five dollars, B = neither win nor lose. This is a one 
dimensional case, and so utility theory requires a p for which 
B ~ ApC. (This p need not be 1/2.) Consider an individual who 
has a negative utility for gambling and who accordingly has, say, 
B ~ A .6 C. If W2 and W3 hold, then we must have 

D = ApB ~ E = Ap(A .6 C) 

whereas the individual might prefer D to E since D involves 
no chance of losing and still has some chance of winning. This 
type of behavior would follow from a lack of symmetry between 
"good" and "bad" prospects. Such a lack of symmetry has been 
apparent in some experiments run by Ward Edwards, reported 
in [1]. 

On the other hand attempts to weaken W2 and W3 lead to 
curious theoretical results. We discuss one of these. 

Let M be a mixture space with a weak order < , and with the 
only connection between order and mixture being the requirement 
(Qj) A < B, and < p < 1 imply A < ApB < B. This axiom has 
the effect of giving a definite order sense to each line in which 
there exist at least two elements A and B with A< B. How- 
ever, this axiom is fulfilled in the following example. Let M 
be two-space and let ^ be defined by A = (aj, a 2 ) ~B = (bj, b2) 
if (i) a 2 < b2 or, if (ii) a 2 = b 2 and ai ^ bj if a 2 is rational, or 
aj bj if a 2 is irrational. Then any line of non-zero slope has 
its increasing direction upwards, but for horizontal lines the in- 
creasing direction is left-right or right-left, according as the 
distance of the line from the x-axis is rational or irrational. In 
particular, it can happen that A< B but ApC >BpC. This might 
well be described as an "irrational" utility ! 



3. NON- ARCHIMEDEAN UTILITY 

The addition of the following (Archimedean) axiom: 
W4. If A < B < C there exists p such that B ~ ApC 
limits utility spaces to one dimension, i.e., to the real numbers. 
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Although dealing with the real numbers is convenient, there are 
some situations in which axiom W4 does not seem to hold but 
for which higher dimensional utilities are satisfactory 

For example, if A = "be hanged at sundown/ 1 B = "be given 
1 common pin," C = "be given 2 common pins," it is clear that 
A < B < C (assuming that pins are of some use). It is not un- 
reasonable to suppose that there are some individuals for whom 
W4 fails to hold in this case. However, a two dimensional utility 
will fit this case if we let the dominant component describe the 
probability of being hanged and the subordinate component refer 
to the relative probability of one pin versus two pins. In more 
detail, we assign to A, B, C, respectively, the coordinates (0,0), 
(1,0), (1,1). Then the utility space can be considered as the tri- 
angle with vertices A, B, C. If D and JE_ are two prospects, we 
have D<JMf E is closer to the side BC, or if the directed line 
segment DE is parallel to and has the same sense as BC ; i.e. 
if D = (a,b) and E = (c,d) we have D<E if a<c or if a = c 
and b<d. 

Another example arises in appraising various alternatives in 
a military situation. The commanding general must give top 
priority to not losing the war; subject to this priority he tries 
to conserve manpower; and, finally, other things being equal he 
attempts to conserve supplies. This could be described by a 
three dimensional utility. 

For a further example, consider a social group in which a 
certain action A is taboo, a second action B is neither good nor 
bad, and a third action C is moderately good. Thus A < B <C, 
but it is extremely unlikely that W4 holds. 

On the other hand, if one is dealing with a purely economic 
situation in which axioms Ml - M5 and Wl - W3 hold, it is 
quite likely that W4 will hold also. Hence, for much of eco- 
nomics one can probably safely neglect non- Archimedean 
utilities. 



4. CALCULATIONS WITH NON- ARCHIMEDEAN UTILITIES 

Suppose that a commanding general is using a two dimensional 
utility to evaluate a military situation. If he has n basic strate- 
gies PI, . . . , P n and his enemy has m basic strategies 
Q!> i Qm> he must first eva *uate the outcome if he chooses 
Pi and the enemy chooses Qj. This evaluation will be an or- 
dered pair of numbers (ay, by) where, say, the first component 
is the dominant one. If we assume that the general is using game 
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theory, his action will be to choose some mixed strategy. If he 
chooses the mixed strategy which assigns probability PJ to Pj, 
i.e , if he chooses a probability vector 77 = (PJ, . . . , p n ) then 
his expected outcome will be at least as good as the vector 
f(?r) = min Sp^(a^j,b;g). Hence, his task is to choose a proba- 
bility vector zr so as to maximize I(TT). Since his order relation 
is lexicographic, he first finds those vectors TT for which the 
first component fj(7r) = min Ep^a^ of f(?r) is a maximum. Ac- 
cording to the general theory of games these vectors form a 
non-empty convex subset S of the set of all probability vectors. 
If S is a one element set his problem is solved, but if S is 
larger than one element he now must search for vectors TT in S 
which maximize the second component t2(ir) = min Spx^xj of *(*) 

Again the theory of games guarantees a solution. If instead of 
two dimensions the general had three or more dimensions, he 
would merely iterate this process. 

This discussion illustrates the fact that non- Archimedean 
utilities. are perfectly satisfactory for game theory. The equiva- 
lence of game theory and linear programming guarantees that 
non-Archimedean utilities will be satisfactory also for linear 
programming problems. 
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CHAPTER XIV 

TOWARDS AN ECONOMIC THEORY OF 
ORGANIZATION AND INFORMATION' 

by 
Jacob Marschak 
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INTRODUCTION 

The statement of Laplace [9] that the theory of probabilities 
is at bottom merely common sense reduced to calculus 1 and the 
saying of Schumpeter [18] that, to a layman, economics must 
appear halb unverstaendlich, halb selbstverstaendlich, have 
similar meanings. Both men said, in effect, that their respec- 
tive disciplines formalized certain concepts and propositions of 
pre-scientific practice. Surely this did not lead them to think 
that their disciplines were useless In fact, both disciplines 
have been fairly successful. 

The present paper is conceived in the same spirit; and it 
happens to touch on both economic theory and probabilities. It 
can be interpreted as extending the economic theory of a ration- 
al firm into the domain of uncertainty, giving special regard to 
the character of a firm as a team. All criticisms leveled 
against economic theory can therefore be also raised against 
the type of work represented by this paper. In particular, in 
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concentrating on the typically "economic" question of the most 
efficient rules of action and communication for group members, 
I have pursued a line different from those who are interested in 
the description of existing modes of behavior within human 
groups. Yet, I do feel the two lines of research will meet some 
time, and that communication between the two types of workers 
is fruitful. When time comes for second approximations of the 
economic, normative approach, limitations on human rationality 
emphasized by descriptive workers will have to be properly 
formalized and introduced into the theory as a factual constraint. 
On the other hand, the criterion of efficiency of a team or of a 
foundation should be helpful in describing those human groups 
which have been created for a purpose: just as this same cri- 
terion of efficiency is a major guide in describing human arti- 
facts such as machines or ships. The history of ships is no 
doubt fascinating and full of suggestions. But to buy or build a 
ship, it is also useful to know what makes a ship safe and fast. 
There is a history of political constitutions. There is also the 
question, What makes a constitution suitable for its task ? 
There are many histories of business corporations. There is 
also the question, Which structures and procedures make a 
business organization profitable ? 



1. TEAMS, FOUNDATIONS, COALITIONS 

1.1. Preference orderings. Human groups can be classified 
according to whether a group goal exists and according to the 
relation between the group goal and the goals of group members. 
A more formal and convenient phrase for "goal" is "preference 
ordering of states of the world;" or simply "ordering," or "in- 
terests." The relation s Gj s* will be read: "from the point of 
view of i, the state s is at least as good as the state s f ." The 
binary relations Gj, 1=1 f 2, ,n will denote the orderings by 
individuals (individual interests), and G o the ordering from the 
point of view of the group (group interest). Groups will be 
classified according to which combinations of the following 
propositions are valid (S will denote the set of all states): 

A. For every i (1=1, , n), there exists a complete order- 
ing Gi on S. [that is, every group member has a prefer- 
ence ordering]:^ Rationality of members. 

B. There exists a transitive ordering G o on S. [That is, 
the states can be ordered for the group, at least partially: 
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see footnote to A]: Transitivity of group interests. 

C. For any s, s f m S , if s GI s f for all 1=1, , n then 

s GO s'. [That is, if s is not worse than s f for any mem- 
ber of the group, it is not worse than s ! for the group it- 
self]: The so-called Pareto optimality principle, 

D. For any s, s f in S, s GO s' or "s f G s.' [That is, for no 
pair of states is it impossible to say which is preferable 
for the group, unless the two are equally desirable]: Com- 
pleteness of group preferences, (Note that B and D imply 
together complete ordering.) 

E. For all i=l, , n, s GI s' if and only if s G o s f . [That 
is, all individuals have identical interests among them- 
selves and with the group]: Solidarity. 

If A is satisfied we call the n group members rational. 
If A, B and C are satisfied the group is a coalition, 
If A, B, C and D are satisfied we call it a foundation. 
If A, B, C, D and E are satisfied we call it a team. 

1.2. Utility functions. The same concepts can be defined 
(somewhat more narrowly 3 ), if the individual complete ordering 
GI (i=l, , n) is replaced by a numerical function ui on S, 
the so-called satisfaction or utility function. The group mem- 
bers are called rational if for every i=l , , n there exists a 
numerical function ui(s) such that ui(s) u^s 1 ) if and only if 

s GI s f . The corresponding group utility function u o (s), if it 
exists, may or may not be a numerical one. If u o (s) is numeri- 
cal, then UQ(S) = u (s f ) if and only if s G o s 1 . In the case of 
coalitions other than foundations, u o (s) is the vector [uj(s),- , 
u nf s )J (vectors are partially ordered). In the case of founda- 
tions, UQ(S) is numerical and is a monotone non-decreasing 
function of uj(s), U2(s), , u n (s). In the case of teams, UQ(S) = 
ui(s) =. . -=u n (s). 

1.3. Incentives; Bargaining. To sum up: an individual is 
rational if his preferences are consistent. A group of such in- 
dividuals is a coalition if, whenever a state is worsened for one 
individual and is unchanged for all others, it is considered as 
worsened for the group. A coalition is a foundation if all states 
can be compared from the point of view of group interest. A 
foundation is k team if group interest coincides with the interest 
of each member. 

A team may be regarded as the simplest form of a group. If 
a group is a foundation but not a team, the group has a goal 
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separate from the individual goals. This gives rise to the prob- 
lem of individual incentives that would best serve the group goal. 
If a group is a coalition but not a foundation, there is no "group 
interest' 1 to help in making all choices: when any state is 
reached where no member can be made better off without mak- 
ing another member worse off, further choices are determined 
by bargaining, and problems of "relative power" arise. If a 
group of rational men is not a coalition, there arises the gen- 
eral problem of a (generally non-constant-sum) game. 

1.4. Some recent literature. In recent literature the paper 
of HerberTSimon [20J on the Employment Relationship, dealt 
with a coalition (in our sense) between an employer and a work- 
er. The work of Bavelas [l] and his group deals with foundations: 
a group goal exists (some task is to be achieved in shortest time 
with least error, etc.) but there need not be complete solidarity; 
some of Bavelas 1 mathematical problems (to find the minimal 
number of messages needed to fully equalize the state of in- 
formation among n members) are team problems. In von Neu- 
mann- Morgenstern's Theory of Games [16], coalitions have the 
same meaning as here; foundations do not seem to occur; teams 
occur, e.g., in the case of a pair of bridge partners. The pro- 
cess of bargaining within a coalition, and the process of forming 
and dissolving coalitions within a general game is not treated in 
the Theory of Games. A theory of bargaining has been presented 
by J. Nash, |13J, |13a]. It is possible that such problems require 
postulates additional to that of the existence of consistent indi- 
vidual preferences. Such postulates were developed in models 
such as R. Bush's [3] and N. Rashevsky's [15]. 

This paper represents a part of the work on teams by the 
author and other members of the Cowles Commission, espe- 
cially Beckmann [2] and Radner [12], [14]. This work included 
also some preliminary attempts to deal with the problem of 
foundations: ([10], Part III; Faxen [6]), but such attempts will 
not be made here. 



2. ACTIVITIES: OBSERVATIONS, MESSAGES, ACTIONS 

2.1. Events and actions. Each state s of the world which, 
as discussed in 1, is compared with other states of the world 
from the point of view of the team's interests, can be regarded 
as depending on two classes of variables: (1) variables that are 
controlled by the n members; we shall call them actions and 
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denote them by aj , , a n ; (2) variables that are not con- 
trolled by the members; we shall call them "state of environ- 
ment" or "external variables", and summarize them by the let- 
ter x. Since the team's satisfaction depends on the state of the 
world s, and s depends on aj , , a n and on x, we can de- 
note the team 1 s satisfaction by 

(2.1.1) u = u(a,x), 

where a = (aj , , a n ). (Because of "solidarity" of team mem- 
bers we have omitted the subscript under u). u is also called 
payoff or gain. The environment x will be considered random, 
with a known distribution function. 

2.2. Activities. To each individual i corresponds an action 
variable a^ Moreover, to each individual i corresponds a set 
Xi of the values of the variable (possibly a group of variables) 
that he can observe, and the value he has actually observed can 
be denoted by x^eXi. For example, xj = economic conditions, 
X2 - political conditions, etc. Thus x = (xj , , x n ). We can 
distinguish three kinds of activities of i: 

1) To make an observation xj on the external world 

2) To perform an action ai upon the external world 

3) To send to a co-member j a message (report) my, i.e., 
a statement about the external world, of the form 



that is, a report states that the observation x^ made by the re- 
porter belongs to a subset Xjj of X|. The report can thus be 
identified as the set Xy. Two extreme cases are: an "exact (or 
most detailed) report" when Xy consists of a single element, 
x ij = l x ih " no report" when Xy =Xi- We can write Xa =Xj. 



2.3. Cost of message. Thus in general a message my is a 
subset of Xi A non-negative number c^m^) will measure the 
cost of the message. We have 

c i({ x i})> CiQty = 0, 

while "inexact messages" (i.e., messages where Xy consists of 
more than one, but not of all elements of Xi ) have intermediate 
cost. In special cases, c^ may be regarded as a non-increasing 
function of the measure of set 
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2.4. Examples. 

1) Let Xi = vector (xj, xp, e.g., x\ = leather price, xj[ ! = 
shoe price. Then a report to j, giving values of x\ and xj ! (so 
that X^ = a point in the x\, x[ f -plane), is more "exact" and 
more costly than one giving the value of x\ only (so that 

a line in the x\, x| f -plane) or of x{' only. 

2) Let Xi = price and let the report X^ state a sub- 
interval a < Xi < ]3; then the report is the more exact and 
costly the smaller - a . 

2.5. Three-phase team. We shall consider three-phase 
teams only: 



Phase I 
Phase II 
Phase III 



Observations made. 
Messages sent and received. 
Actions performed. 



Actual teams may have, of course, many more phases. Mes- 
sages will occur that do not fall under the definition of our Sec- 
tion 2.2.: e.g., reports transmitting a third member's message 
In the present paper, after Phase I, the member i has informa- 
tion Xi . After Phase II his information is the logical product 

( ) /~\ ^ f~\ /**\ imp 

i x if n X li n . . . nx n i. 

2.6 Best rules. We shall be concerned with determining the 
best rules of action and communication. A rule states, for a 
member i, the "response" he should make to a given "stimulus. 1 
In Phase II, the stimulus consists of the observation xi, and the 
response consists in sending the n-tuple of messages, 
(mil 9 ' * ' 9 m in) = m l ( a row ve ctor). The rule for Phase II is a 
function R{ such that 

(2.6.1) m\ = F" 



" 



involving a communication cost Ci(mj), say. In Phase III, the 
stimulus of i consists of the observation Xi and of the n-tuple 
of messages received by him, (m^, , m n i) = mi, a column 
vector. The response is the action a t , and the rule for Phase III 
is a function R^ such that 

(2.6.2) ai^R^x^mi). 

We shall write x, m f , m, a, R , R n for the corresponding n- 
tuples (xj , , x n ), (m'j , , m^) etc., and denote by c(m) 
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the cost incurred when the matrix of messages sent is [my]. 
From now on, we shall call u(a,x) the gross gain of the team. 
The net gain of the team is 

(2.6.3) v = u(a,x) - c(m) = v(x; R 1 , R n ; u,c), 

a quantity depending on x; on the rules that determine m as a 
function of x and a as a function of x and m; and on the func- 
tions u and c that measure the team satisfaction and the cost 
of messages. 

We have assumed that in the opinion of the organization or 
team a probability measure on the set X of the states x of en- 
vironment exists and is known to them. This distribution of x 
will be denoted by F(x). The expected value of the net gain is 

(2.6.4) /v(x; R 1 , R 11 ; u,c) dF(x) = V(R J , R 11 ; u,c,F), 

2 

say. We want to find the rules R 1 = R 1 and R n = ft n that will 
maximize V. Clearly they will depend on the functions u,c and 
F. We want to study this dependence. What properties of the 
functions u,c and F make for more or for less frequent and 
detailed communication between a given pair of members? 
What conditions make it profitable to cut off direct communica- 
tion between i and j ? 

2.8. Maximizing expected utility. In Section 1, a decision- 
maker was called rational if the states of the world at his choice 
were completely ordered (ranked) by his preferences. This or- 
dering was represented by a numerical utility function. In the 
present Section, the preferences must be conceived in a manner 
that allows for the uncertainty of outcome of a decision. The 
decision-maker is pictured as assuming a certain probability 
distribution F(x) of the environment. Therefore the outcome of 
each decision is a probability distribution of events, a "pros- 
pect"; though in a special case the distribution may degenerate 
into a sure event. The set S of 1.1 is to be reinterpreted as a 
set of prospects. A rational decision- maker has a complete or- 
dering of prospects. A numerical utility function can be attached 
to prospects so that u(s f ) = u(s ff ) whenever prospect s f is pre- 
ferred or equivalent to prospect s". That is, the rational man 
chooses that prospect s for which u(s) is a maximum. 

However, the concept of rationality which we shall use is 
stronger than the one just stated. The rational decision-maker 
is defined as being able to represent his preferences by a nu- 
merical utility function possessing the following property. 
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Denote by (sj , , s m ; pj , . . , p m ) a prospect consisting of 
the anticipation of prospects (possibly, sure events) sj, , s m 
with respective probabilities pj , , p m . Then u(si, , s m ; 
Pi , , Pin) > u(s! , , s m ; pj, , p^) if and only if 



In other words: if we regard u(si) , , u(s ) as alternative 
values of a random variable u(s), the choice between two pros- 
pects characterized by the probability vectors { p[ } and { p| f } 
respectively depends on the values of a single parameter of the 
two considered distributions of u(s), viz, on its expectation 
(mean). In this sense, the rational man is defined as maxi- 
mizing expected utility. This definition was used in 2.6 where 
we ask for the rules of action and communication that would 
maximize the team's expected net gain. 

This definition of rational decision under uncertainty goes 
back to Daniel Bernoulli and has been recently much discussed, 
under the impact of certain behavior postulates formulated in 
the Theory of Games. Here we shall not discuss these, or 
similar, postulates in detail. 4 Instead, it will suffice to show, 
in heuristic fashion, a way to assign to one's prospects a utility 
scale possessing the required property; and to claim that a sys- 
tem of preferences permitting such a scale does characterize 
an ideally consistent decision-maker, e.g. an ideal organizer 
and legislator for a team. 

Assume provisionally that there exists for such a decision- 
maker a "worst" and a "best" state of the world. Denote them 
to by s and s^, respectively. Assign them utility numbers 
u(s ) = and u(sj) = 1, and proceed to "calibrate" on this scale 
the utilities of all other states. To begin with, consider a pros- 
pect promising sj with probability TT and s o with probability 
1 - TT. In the notation just proposed above, such a prospect will 
be denoted by (sj, s ; TT, 1 - TT). The decision-maker you, the 
reader, for example will (a postulate is hidden here!) consider 
such a prospect better than s and worse than sj. Therefore 
you will assign to it a utility number between and 1. The 
probability is itself such a number. It is therefore permissible 
to choose a utility function u such that u(sj, s ; TT, 1 - TT) = TT, 
for any n between and 1, including and 1 themselves 
[since u(sj, s o ; 0, 1) = u(s o ) = and u(s lf s o ; 1,0) = l]. 

Consider next some prospect s that is neither s nor sj 
nor the promise of sj or s o with some probabilities. If the 
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decision-maker is indifferent between s and s^, or, alternative- 
ly, between s and S Q , then, of course, u(s) = or 1, respec- 
tively. There remains the case when he considers s better 
than s o but worse than Sj. In this case, by scanning all pos- 
sible prospects of the type (sj, s o ; TT, I-TT), with TT ranging 
continuously from to 1, one will find (a postulate is hidden 
here!) one such prospect that the decision-maker will be indif- 
ferent between it and the prospect s. That is, there will be a IT 
such that u(s) = u(sj, s o ; TT, 1 -TT). And since, on the utility 
scale chosen above, u(si, s o ; TT, I-TT) = TT, we shall have U(S) = TT. 
Thus we have assigned utility numbers to all prospects. 

To show that this utility scale possesses the required prop- 
erty, compute the utility of the prospect s = (tj, t2 , , t m ; 
Pl P2 ' ' i Pm)> where tj , , t m are prospects (possibly 
sure events) and p^ , , p m their probabilities. Let the utility 
of each ti be measured on our scale; that is, u(tj)= TIJ if the de- 
cision-maker is indifferent between t| and (tj, t o ; TTI, 1 - TTJ). 
Then the decision-maker is indifferent between the prospect 
(tl , ' , t m ; PI , , p m ) and the prospect [(sj, s o ; v lt 1 - *i), 

(Sl, S ; 7T2, 1 - 7T2) , , (Sj, S J 7T m , 1 - 7T m ); Pj , -, p m ], 

(actually a postulate is hidden here). But the latter prospect is 
nothing but the promise of s^ with probability pjiri + P2^2 + ' * ' 



m m 

Pm^m = Pi u (ti); ai *d of s o with probability 1 - Z pjuUj). Now, 

on our scale, the utility of such a prospect is simply equal to 

the probability of si . Hence 

m 
u(tj , ', t m ; Pj, , p m ) = E Piu(ti); 

the expression on the right is the "expected utility" of the pros- 
pect (t x , , t m ; pj , , p m ). Hence, if two prospects 



and (*!* * '^mJ Pi 1 *' 'iPm) are 
compared, the preferred one will have the higher expected 

utility. 

The provisional assumption that there exist a best and a 
worst states can be dropped. We can assign utilities 1 and 
to any two states, sj and s o of which the former is preferred 
to the latter; and assign, as before, utility numbers xr(o^7r^l) 
to all prospects which are not better than sj and not worse than 
s O o To assign a utility number to a prospect (say 82) that is 
better than Sj, we first form a number TT (0 < TT < 1) such that 
the decision-maker is indifferent between sj and (52, s o ; TT, 1 - TT); 
and let, accordingly, u(s) satisfy the condition 
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7T.u(s2) + (1 - TT) u(s ) = u(s 1 ), and hence 7r.u(s 2 ) = 1, u(s2) = - >1. 

Similarly, if s 2 is worse than s o , we find a number IT such that 
the decision-maker is indifferent between s o and (sj, s 2 ; v, 1 - ff) 
and define u(s 2 ) by u(s o ) = Tr.ufej) + (1 -?r).u(s 2 ); and hence 
U (s 2 ) = - 7r/(l - TT) < 0. It is easy to see that this utility scale, 
now extended beyond the interval [s , sj], will still possess the 
required property. Of course, the numbers in the scale will 
change linearly as s o , sj the prospects chosen to have zero- 
utility and unit-utilityare changed; but the required property 
is preserved. 

The postulates tacitly used in this observation of utility- 
scaling can be roughly listed as follows: 

1. Numerical function u(s) exists, such that u(s^) u(s 2 ) if 
and only if Sj is preferred or equivalent to s 2 . 

2. If u(s 1 ) > u(s 2 ) then, for any 83 and any probability p, 
u(sj, S 3 ; p, 1 - p) u(s 2 , s 3 ; p, 1 - p). 

3. If u(sj) > u(s 2 ) > u(s3) then there exists a probability p 
such that u(s 2 ) = u(sj, 33; p, 1 - p). 

2.9. Unknown and subjective probabilities. A natural objec- 
tion against formulations like ours is to say that in reality the 
probability distribution F of external conditions is not known. 
In this case the problem becomes one of statistical inference, 
invoking principles like tf known a priori distribution on the 
space of F f s," Vf minimax," etc. For the case of a "one-person 
team" (when activities consist only of observations and actions, 
no messages being sent) one might refer to works like 
Dvoretsky-Kiefer-Wolfowitz 1 second article on Inventories [5]. 

The assumption of a known a priori probability distribution 
is particularly germane to our problem, whether in the simple 
case of three-phase-teams or in more complicated cases. In 
a three-phase-team, there is no opportunity to acquire or im- 
prove the knowledge of the distribution F in the course of the 
team's operations. In this case, and if F is not known from ap- 
propriate statistical manipulation of data collected in the past, 
we shall still say that a rational decision-maker, if he makes at 
all choices among actions, has to make consistent assumptions 
about the probabilities of outcomes of the actions just as he has 
a consistent scale of the utilities of those outcomes. A postu- 
lational basis for the simultaneous existence of subjective utili- 
ties and probabilities, extending in a plausible way the postulates 
of 2.8, was proposed by Frank Ramsey, and more recently by 
L. J. Savage. 5 
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If the operations of a team form a sequence of more than 
three phases, it is still useful to obtain the benefit of subjective 
probabilities. The rational decision-maker will start with some 
subjective probabilities. Even if they are mere hunches roughly 
summarizing his previous experience they will help him to lay 
down a plan as to how best to respond to any given sequence of 
future observations by an appropriate sequence of actions and 
communications. His actions, in this case, may but need not 
include data manipulation to provide progressively improved 
estimates of the distribution F of observables. 

2.10. Orders. In 2.2 all messages were defined as state- 
ments (reports) about the external world. Reports are different 
in verbal content (and therefore differ in cost) from "orders" or 
"task assignments." It seems, however, that by introducing or- 
ders no essential change will be introduced in our problem, at 
least at the present stage of analysis. 

For example, suppose the relevant features of the world are 
characterized by two variables, xj and ^. Let team member 1 
observe Xj and team member 2 observe ^ and let the re ~ 
spective observed values be x^ and X2. Consider first the 
following rules: R 1 consists of R^ only, viz.: 1 tells 2 the 
observed value xj; R^ consists of R^ only, viz.: 2 chooses 
a 2 = a 2 that wil1 minimize u(a2, XI Q , X2). In this case, mem- 
ber 1 is a mere "reporter." The payoff function u(a2, xj, X2) 
a function of three variables is fully known to member 2, but 
need not be known to member 1. 

Now suppose, instead, that it is the member 1 who knows the 
function u(a2, xj, x 2 ), and let the rule R 1 = R] 1 be as follows: 
1 shall communicate to 2, not the observed value xj , but the 
"task," i.e., the function of two variables 



Member 2 then proceeds to accomplish the task. That is, the 
rule R n = R2 n is: member 2 (who had observed X2 = X2) shall 
choose a 2 = &2 so as to maximize UQ^, X2). In this case, 
member 1 is the "boss": he determines, in the light of his 
knowledge of Xj = Xj , the special task that member 2, the 
"subordinate," has to fulfill i.e., the specialized payoff func- 
tion u o that 2 has to maximize. In military terminology, 1 is 
in charge of strategy, while 2 is in charge of tactics. The ob- 
servable xj may consist of information about the overall plans 
of the enemy (plus the information received by the commander 
during his training), while X2 may be a local situation. The 
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distinction between reports and orders will not be pursued fur- 
ther in this paper.** 

2.11. Non-additive costs. The definition of net gain in (2.6.3) 
is quite appropriate when the amount to be maximized is the ex- 
pected net money gain. In more general cases, one cannot rep- 
resent the maximand as the difference between (expected) gain 
and (expected) cost and will use some more general function to 
describe net gain. However, (2.6.3) will help us fix the ideas. 

2.12. A remark on the general multi-phase case. 7 The 
three-phase case can be generalized by constructing, for every 
phase t = l, ,T, the following matrix with n+1 rows and 
columns (regarding aj as a set of values): 

ajj means: "what i tells j" (a subset of Xj orclj) 

aj o means: "what i does to the outside world* 1 (an element 
of Si) 



means: "what i observes in the outside world" (an ele- 
ment of 



a ii> a oo are empty. 



Then a{ = i-th column = i f s recently acquired information, 
= sequence (a*, 24 ) = i f s state of information, 



a*? = i-th row = i f s messages and actions. 

In a stationary team there exists a set of rules R= (Rj, 
independent of time, such that 



The team's gross gain for the period ending at T depends on 
the sequence of external events, ZQ ; and on the sequence of 
actions taken, aj; but since the latter depend on aj and on 
the rule R, the gross gain is u(aj, R), say. The cost of com- 
munications will also depend on Sj , R. Hence the net gain is 
vfaj, R), say. The distribution of external states a depends, 
in general, on the sequence of previous states as well as on the 
sequence of previous actions ("controlled stochastic process"). 
Hence the distribution of the sequence Sj depends on the initial 
state a and on the rule R. Write this distribution as 
F(aJ; a, R). Then the ^expected net gain (integrated over the 
space of all sequences a<j ) is 
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v(^, R) dF(a J; ag, R) = V(R; a o ; v, F), 

say. The rule R=K that maximizes this expression will depend 
on aQ, v and F. As in the three-phase case, the problem is to 
find how certain classes of rules (certain networks and proce- 
dures) correspond to certain classes of tasks or, more gen- 
erally, to certain classes of net gain functions and to certain 
classes of distribution of external conditions. 

2.13. Procedures and Networks. In the problem defined so 
far, one seeks to determine what each member has to do or to 
communicate in response to observations he makes and to mes- 
sages he receives. Which are the members from whom he can 
receive and to whom he can send messages is a given not an un- 
known of this problem. The unknown of the problem is, in other 
words, the best procedure (the rule R), given the network of 
communication. The other givens are the cost c of sending a 
message; the gross payoff function u depending on certain ac- 
tion variables and observables; and the probability distribution 
F of the observables. 

In a more complete team problem (it may be called the prob- 
lem of the team's constitution), the network is not given. In ad- 
dition to the cost c of sending a message over an existing net- 
work (cost of using the network) there is the cost C = C(N) of 
constructing a network N. More precisely, if the team is a 
three-phase-team, C(N) is the cost of maintaining and amortiz- 
ing communication facilities per period of time needed to go 
through its three phases. The cost C depends on the nature of 
the network N involving, for example, one-way communications 
(letters), two-way communications (telephone), many-way com- 
munications (conferences) between certain members. C does 
not depend on whether certain communication lines are or are 
not being used, i.e., on whether a message is or is not actually 
sent. 

Given, then, the cost c of sending a message; given the func- 
tion C(N) determining for each network the cost of maintaining 
and operating it; and given the functions u and F, the constitu- 
tional problem seeks to find simultaneously the best pair (N,R), 
i.e., the network and the procedural rule such that the expected 
net payoff be a maximum. The procedural rule R consists, in 
three-phase teams, of the communication rule R 1 and the action 
rule R . The unknowns of the complete team problem are: N, 
R 1 and R n . 
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Note that the concept of network does not confine itself to the 
physical communication plant. The description of a given net- 
work (also called "organization chart") states all the permanent 
"positions" within the team, thus including all long-tern employ- 
ment contracts, all regularly occurring conferences (but not 
emergency conferences), periodical reports (but not ad hoc re- 
ports), etc. The number of team members is thus itself a char- 
acteristic of the team, and the optimal size of a team for a given 
task i.e., for a given function u(a, x) is a part of the constitu- 
tional problem. 

The distinction between network and procedure is analogous 
to the economists 1 distinction between long-run decisions (such 
as building a plant) and short- run decisions (such as buying a 
certain amount of fuel). Although important intermediate cases 
exist, the distinction is useful at the present stage of analysis. 

One way to study the complete team problem is to start with 
"pure network problems" and "pure procedural problems." In 
pure procedural problems, the network is given. Hence the 
function C(N) (costs of maintaining a network) is not involved, 
but the cost c of sending a message is. The unknowns of the 
pure procedural problem are (in three-phase teams): the com- 
munication rule R 1 and the action rule R 11 . 

In pure network problems, on the other hand, the network N 
is not given, and the cost c of sending a message is assumed 
negligible o Therefore, any communication line if it exists can 
be assumed to be used, so that R 1 is uniquely defined for each 
N. The unknowns of the pure network problem are (in three- 
phase teams): the action rule R n and the network N. 

Section 3 of the present paper can be said to deal with a pure 
network problem, although in a very rudimentary form. A some- 
what more detailed analysis of pure network problems and of 
complete team problems has been attempted elsewhere: see 
[12], [14]. Section 4 starts with a pure procedural and ends with 
a constitutional problem of a very simple nature. 



3. BUYING INFORMATION 

3.1. The value of an inquiry. Some important aspects of the 
team problem are brought out in the following simple case. A 
person's profit (utility) is u(a,x) where a is his action and x is 
the state of environment (an element of the set X). The person 
knows the distribution F(x). Assuming that he tries to maximize 
expected profit, how much should he be willing to pay for exact 
information on x ? 
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Compare the maximum expected profit U under two alterna- 
tives: when the person has not inquired, and when he has in- 
quired about the actual value of x. If he has not inquired, he 
will choose an action which is independent of x and which will 
maximize the expectation of the profit. This expectation will 
therefore be 

U = U = max Eu(a,x). 

L 

If, on the other hand, the person has inquired about the actual 
value of x he will be able to choose an action that will maximize 
the actual profit. The action chosen will depend on x. The ac- 
tual profit will be max u(a,x). The expectation of profit will be 

a 

U = Ui = E max u(a,x). 
a 

The difference Uj - U o can be called w, the value of inquiry: 

(3.1.1) w = E max u(a,x) - max Eu(a,x) >0, 

a a 

for Ui can never be smaller than U o .^ 

The statement that an inquiry can never have negative value 
( n no damage in knowledge I ") is easily confirmed as follows. 
Consider first the case when inquiry has been made. Then to 
each observed value of x will correspond a best value d of the 
action variable a, viz., the value of a that maximizes u(a,x) 
for the observed value of x. The relation between x and & we 
can write as & = (x). We call the function & the best rule of 
action. Since in applying this rule of action one obtains the 
highest value of u(a,x) consistent with the observed value of x, 
the result of applying this rule of action for every x is to ob- 
tain a higher expected value of u(a,x) than in applying any other 
rule of action. We can thus write 

Ui = max Eu(cr(x),x) = Eu(a(x),x), 

Of 

to indicate that the expected utility has been maximized with re- 
spect to the function a . Note that in the equations just written 
the symbol a does not appear; the utility u was expressed as 
depending on x and on the rule of action, a . 

Now consider the case when inquiry has not been made. This 
time the unknown function a (x) degenerates into a single un- 
known constant, since expected utility has now to be maximized 
in ignorance of x. We can write 
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U = max Eu(a(x),x) where a (x) = unknown constant. 

Thus Uj and U o are both obtained by maximizing the same 
quantity over a set of functions, but in the case of U o this set 
is restricted by the condition that a (x) be a constant. It is 
clear that if one is restricted in the choice of the rule of action 
one can never obtain a better result than if one is not so re- 
stricted. That is, Ui ^ U o . 

3.2. Example I: "guessing the color. n The player knows 
that an urn contains a proportion p ^ 1/2 of black balls, the 
other balls being red A ball is drawn from the urn. It is not 
shown to the player but may be shown to his agent. The player 
has to state the color of the ball. He wins a dollar if his state- 
ment is correct, loses a dollar if it is wrong. How much should 
he be willing to pay the agent for informing him about the actual 
drawing ? 

Solution: If the player would not employ an agent his best ac- 
tion would be to bet on black. This would yield him an expected 
gain U = pl + (1 - p)(-l) = 2p - 1 dollars. If, on the other hand, 
he employs an agent, his expected gain is Ui = 1. Hence the 
player should be willing to pay the agent any amount up to 
w = Uj - U o = 2 - 2p We call w "the value of inquiry" 

3.3. Example II: "a monopolist." A monopolist sets the 
price a of his product and produces just enough for the demand. 
At price a the demand is hx - ka where x is a random variable 
depending on the public's taste, and k is a positive coefficient. 
The mean p. and the variance o 2 of x are both known to the 
monopolist. For greater convenience, and without loss of gen- 
erality, the coefficients h and k can be made equal to 2 and 1, 
respectively, by appropriate choice of units. Then the profit is 
equal to a(2x - a) minus the cost of production. Assume this 
cost to be independent of the amount produced. How much can 
the monopolist pay a market research agency for keeping him 
informed of current public tastes, i.e., of the actual value of x? 
(We shall assume that, for the monopolist, utility and profit are 
identical, i.e., he maximizes expected profit.) 

Solution: Since cost is independent of his decision, the 
monopolist can disregard it. He will maximize the expectation 
of the quantity u = a(2x - a). If he is ignorant of the actual value 
of x he chooses the price a so as to maximize Eu = E a(2x - a) 
= a(2/z- a); the chosen price will be a = /i, resulting in an ex- 
pected profit (disregarding constant cost) U o = ji 2 . If, on the 
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other hand, the monopolist is informed of the actual value of x, 
he will choose every time the price a so as to maximize 
u = a(2x - a); i.e., his best rule of action will be: a = x. This 
yields an expected profit (disregarding cost) Uj = E x2 = (^2 + a 2) 
Hence the value of inquiry is w = Uj - U o = a 2. Thus the serv- 
ices of the market research agency are the more valuable the 
stronger the variability of tastes; (although, if the demand were 
not a linear function of price, the approximate measure of vari- 
ability would turn out to be not a 2 but some other parameter of 
the distribution of x). I shall not subscribe to a newspaper if 
the news does not vary much from day to day! 

3.4. Example III: "a speculator. " Suppose a speculator 
cannot buy or sell more than one unit of a commodity. Let x be 
the difference between tomorrow's and today's price. The specu- 
lator knows that x is distributed uniformly between JLL - p and 
JUL -- p. How much is it worth for him to foresee exactly the price 
change, x? 

Solution: Suppose first that he is ignorant of x. If the mean, 
JUL (known to him I) is positive his best action is to buy one unit; 
if IJL is negative, the best action is to sell one unit. The expected 
gain is U = | Ml- If, on the other hand, he is informed of the ac- 
tual value of x his best rule of action is: to buy one unit if x is 
positive, to sell one unit if x is negative. This yields him a 
gain |x|. To compute the expected gain Uj = E|x| distinguish 
two cases: 1) if |/i| ^ p, x has always the same sign and 
U x = E|x| = I /JL| = U ; 2) if \n\ < p, we have 

i /f M+P f \ 

U x = |x| = y-ll xdx+1 -xdxl 

v* o M~P 

- M 2 + P 2 
" 2p ' 

Therefore the value of inquiry is 

if |/i| A p 

(3.4) w = Uj - U = 

(|/i| -p) 2 /2p if|Ml L p. 

Thus, inquiry is the less valuable the stronger is the known bias 
| \L | of the distribution of the price difference x; and the smaller 
is the range 2p of its variation. (See Figure 1.) This is as 
would be expected on intuitive grounds. 



1 /(M+ 
~ 2p\ 2 



204 



DECISION PROCESSES 




25- 



Fig. 1. The ''speculator's case 11 . 

Value (w) of inquiry when x is distributed uniformly with 
mean \i and range 2p. 



3.4.1. It may be noted that the "speculator's case" as de- 
fined has a more general application than it might appear. In- 
stead of interpreting the extreme values of a as "buy or sell 
one unit" one might regard them as "do or don't." In fact, any 
two-valued set of decisions, so frequently occurring in practice, 
would fit the example, provided u is linear in a; and, only for 
the sake of simplifying the computations, x, or some transform 
of it, is assumed to be uniformly distributed. For example 
a = + 1 may mean "attack," a = -1 "don f t attack," and x = differ- 
ence between our forces and those of the enemy. In all these 
interpretations, since u = ax, positive x makes a = 1 advisable, 
and negative x makes a = -1 advisable. Clearly, with u linear 
in a, the choice of action that will maximize the expected value 
of ax is not affected if the pair (-1, +1) is replaced by (0, 1) or 
by any other pair of numbers of which the first is smaller than 
the second. 

3.5. Value of inquiry and "amount of information." Let us 
now summarize our examples using our general notation. 

In the "monopolist's case", u = a(2x - a) and F(x) has known 
mean M and variance a 2 . One obtains 

U = max E u = ;i 2 ; (best a = /i); 
a % 

Ui = E max u = (n 2 + a 2 ); (best rule of action: a = x); 
a 

w = a 2 . 
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In the "speculator's case" u = ax; -1 4 a 4 1; F(x) is uni- 
form, the lowest and highest values of x being, respectively, 
M - p and p. + p. The quantities U o , Uj, w were evaluated in 
3.4 above. 

The case of "guessing the color 11 is a variant of the specu- 
lator's case; u = ax; a and x have each only one pair of values 
(-1 and 1). 

Thus, the value of inquiry, w, depends on both the distribu- 
tion F(x) and the payoff function u(a,x). The amount of informa- 
tion as defined by Shannon [19] (or its negative, the "entropy" 
or "uncertainty" or "randomness") depends on F(x) only, as 
does the amount of information defined by R. A. Fisher [7]. As 
a result, the value of an inquiry, for a given payoff function, need 
not coincide with, or be related in any simple way to, the amount 
of uncertainty in Shannon's sense For example, in 3.2, the 
value of inquiry permitting now p to be smaller than 1/2 and 
writing q = 1-p is w = 1 - | p - q|, while Shannon's uncertainty 
in the case of binomial distribution is proportionate to (or to the 
logarithm of) pP q^. Note that both expressions have a maxi- 
mum when p = 1/2 (maximum uncertainty) and a minimum when 
p = 1 or (certainty). If x is uniformly distributed with mean 
fi and range 2p, Shannon's entropy depends on p only; while the 
"value of inquiry" in the case of such a distribution and assum- 
ing the payoff to depend linearly on a bounded action variable a, 
depends also on the mean, M, as in (3.4) above. Finally, in our 
example of 3.3, the value of inquiry is proportional to the vari- 
ance <TX, for any distribution, while entropy is related to vari- 
ance in a simple way in the case of normal distribution but not 
in general. 

The concepts of value of inquiry and of amount of information 
are thus different. They serve different purposes. 



4. ONE OBSERVER, ONE ACTOR: 
NETWORK^ PROCEDURE. CONSTITUTION 

4.1. The problem of "buying information" as a network 
problem. In 2.13, we distinguished between "pure network prob- 
lems," "pure procedural problems," and complete or "constitu- 
tional" problems. In Section 3 we have dealt with a simple 
(perhaps the simplest possible) pure network problem. The 
decision always to obtain information or never to obtain it was 
supposed to be taken once for all. Such decision defines a net- 
work. If the decision is never to receive information, the 
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network chosen is that of a team consisting of a single man, an 
"actor," who takes no observations, but always acts in a certain 
way, that is the best on the average: a "traditional" or "routine" 
system. If the decision is always to receive information, the 
network chosen can be interpreted in two ways: 

1) as a team consisting of a single man ("observer-actor") 
who is provided with more or less costly physical facilities for 
observations, or uses a non-negligible part of his time for ob- 
servations; 

2) as a two-man team an actor and an observer with more 
or less costly physical facilities for observation and communi- 
cation; in this case, the cost of using the communication line, 
once such a line is established (or once a fixed, non-negligible 
part of the team's time is set aside for communication), is neg- 
ligible, so that the observed value of x is always communicated 
by the observer to the actor, regardless of the observed value of 
x (e.g., regardless of whether there is or there is no "emer- 
gency"). 

4 2o A procedural problem. Let us, now, consider as given 
the following network: One member of the team (the "observer") 
makes observations on the external world. There is a one-way 
communication line between him and the other member (the 
"actor"). After having made the observation the observer may 
either communicate or not communicate it to the actor. In the 
former case, a cost, c dollars, is charged to the team, and the 
actor chooses his action on the basis of the observation com- 
municated to him. In the latter case no communication cost is 
charged, but the actor has to choose his action without knowledge 
of the actual state of the external world. Of course, the proba- 
bility distribution of the states of the world is assumed to be 
known in each case. 

The problem is: 1) to find the rule of communication for the 
observer, i.e., to determine conditions (observed states of the 
world) under which he shall or shall not report to the actor; 
and 2) to find the rule of action for the actor, i.e., to determine 
which actions he shall perform upon receiving certain communi- 
cations or upon receiving none. 

Denote as before by X the set of all states of the world x. 
Let X 9 x be the subset of X such that if the observed state 
belongs to X , the observer shall send no report. Thus, not to 
send a report is tantamount to telling the actor that x is in X . 
The rule of communication can be written as follows (with X 
yet to find): 
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R(x) = X if x e X (no report sent); 
(4.2.1) 

= {x} if xeX-X (report sent). 

The function R corresponds to R 1 of Section 2.6. Its value 
R(x) is the information made available to the actor. The rule 
of action can be written as follows: 

(Action a if R(x) = X, xe X 
(4.2.2) { 

'Action a(x) if R(x) = |x},x/X. 

The action a and the function a are yet to find. a is the ac- 
tion that maximizes the conditional expectation of gross payoff, 
given that x belongs to X : 



(4 2.3) max 

Thus a does not depend on x. It depends on the set X that the 
team chooses in advance of observation. We can write a = 
a(X). On the other hand, a(x) is the action that maximizes 
u(a,x) for given x: 

(4.2.4) max u(a,x) = u[a(x),x], xe X - X. 

a 

The function a together with the constant a correspond to R** 
of Section 2.6. Clearly, it is advantageous for the team to have 
the report withheld whenever the observed x is such that 
u[a(X),x] is larger than u[cr(x),x] - c. That is, the net expected 
gain is maximized if X, a and a satisfy the following condi- 
tion: 

(4.2.5) X = { x|ufa(X),x] > u[a(x),x] - c } . 

[This intuitive result can be easily obtained formally by express- 
ing the net expected gain through combining (4.2.3) and (4.2.4)]. 

4.3. Example: the procedural aspect of the "speculator's 
problem". As in Section 3.4, let u = ax, -1 4 a 4 1. Then u 
takes its maximum when a is either -1 or +1. a = 1 may mean 
M buying one unit" and a = -1 "selling one unit." The observable 
x may mean the predicted price change, and u the profit. 
Therefore a negative x makes a = -1 advisable; a positive x 
makes a = +1 advisable. [As remarked in 3.4.1 above, a = 1 
may also mean "attack," a = -1 may mean "don't attack," and x 
may mean "difference between our forces and those of the enemy. 11 ] 
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To simplify computations, x is assumed to have uniform proba- 
bility distribution. 

The pure network aspect of the team problem given by the 
described nature of a, x and u was discussed in 3.4 above. 
We shall now deal with the procedural aspect of the same prob- 
lem and, later (in 4.5) present the solution of the complete 
problem of n best constitution" when a, x and u are as de- 
scribed. 

4.4. In our example, the observer finds the actual value of x 
and may or may not communicate it (at cost c) to the actor who, 
in either case, has to decide whether to buy or sell one unit. 
What are the best rules for communication and action, given the 
(uniform) probability distribution of x? To simplify the problem 
we shall first assume the mean /i = 0, so that -p < x i p. 

The communication rule can belong to one of two classes: 

XV : call only if the desirable action = 1 (= "buy") 
p- : call only if the desirable action = -1 (= "sell"). 

Let us first look for the best rule in I 1 *. Then, if the actor is 
called, he will choose a = 1; if he is not called, he will choose 
a = -1. In the notation of Section 4.2, 

a = a(x) = 1, x/X 
a = a =-1, xe X. 

Since u = ax, condition (4.2.5) defines the best set X as 
follows: X (the set of observations that are not communicated) 
shall consist of all x such that 

-x > x - c, i.e., x < c/2. 

Hence the best communication rule belonging to the class I* + 
is: to call whenever x ^ c/2. The corresponding best action 
rule is: buy if called, sell if not called. 

We find by a similar reasoning that the best communication 
rule belonging to the class X 1 " is: to call whenever x ^ -c/2. 
The corresponding best action rule is: sell if called, buy if not 
called. 

Using now the known distribution of x, we have to compute 
the net expected gains (V+ and V", say) obtained under these 
two competing rules, and choose the rule yielding the higher net 
expected gain. In our case, we have 
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c/2 f p 

c/2 



- xdx + / 

Jc/ 







V = | | (-x-c)dx + / 

/.a 



-c/2 / p 

(-x - c)dx + / 
-p -/-2/2 



= 
Hence 




, 
V+ = V- = (p - |) 2 /2p , c ^ 2p 

=0 , c ^ 2p . 

Thus in our case both rules are equally good and can be summed 
up as follows: if the cost of communication exceeds or is equal 
to the range 2p of the observable, the observer shall never com- 
municate, and the actor shall take any action he pleases (since 
the expected gain = in any case); if the cost of communication 
is less than 2p, then either (1) the observer shall call whenever 
x > c/2, and the actor shall buy whenever called; or (2) the ob- 
server shall call whenever x < -c/2, and the actor shall sell 
whenever called. We shall call these rules (1) and (2). 

4,5. If we had not assumed uniform distribution of x to be 
symmetrical (midpoint JLL = 0) the two rules would not turn out 
to be equivalent. In this, more general, case the solution is as 
follows: If c/2 = p - | M 1 1 never call; buy (sell) if M > 0), 
always yielding the expected gain I /i | . If c/2 = (p - | JUL |), then 
apply rule (1), or rule (2) above depending on whether M is 
negative or positive (See Figure 2.) This agrees with common 
sense. If /i < 0, then a large positive x( 5* c/2) is less likely 
to occur than a numerically large negative x( ^ -c/2); and it is, 
of course, preferable to pay for signalling rarely rather than 
frequently. 

If, as suggested in 3.4.1, a = +1 means "do" (e.g., 
"attack") and a = -1 means "don't," then rule (1) becomes: 
"call to encourage doing whenever the situation is very favor- 
able relative to cost of communication (x > c/2)"; rule (2) be- 
comes: "call to warn against doing, whenever the situation is 
very unfavorable relative to the cost of communication (x < -c/2/ f . 
Roughly, if favorable situations are less likely to occur (JJL <0), 
apply rule (1); otherwise, apply rule (2). 
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Fig. 2. The "speculator's case 11 with one observer. 

Rules of action and communication ("call" region shaded) 
when x is distributed uniformly with range 2p and with 
mean /z, negative (above) or positive (below). 



If JLL -4 and the best appropriate rule is applied, a reason- 
ing similar [but more complicated: see [12], pp. 15-17] to that 
in 4.4 yields the following net expected gain: 



(4.5.1) V = 



(p - 



if p - | n 
otherwise. 



4,6. The f> constitutional M problem. Let the cost of having an 
observer (per time period needed for one decision) be C. We 
have now the choice between two networks: one without, the 
other with observer. In each network the best appropriate rules 
will be applied. We have thus to choose between two "constitu- 
tions": I) not to have an observer, and let the actor use the best 
action; II) to have an observer, and to let him apply the best 
rule of communication while letting the actor apply the best rule 
of action. 

Under Constitution I, the best action is: a = -1 if M < 0, 
a = 1 if |i > 0. This results in a net expected gain Eax = aEx = 



Vj = |u|. 
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Under Constitution II, the expected gain obtained under the 
best rules of communication and action was shown in (4.5.1), 
where the cost c of sending a message but not the cost C of 
maintaining an observer (per period of time needed for each 
decision) was taken into account. The expected gain, correcting 
for this latter cost, is therefore 



v n = -c + | ji | + (p - IM | - W/2p if ^ p - |M I ; 

AA t t 

= -C + I jJ. I otherwise. 

Therefore the difference Vjj - Vj is positive, and hence Consti- 
tution II is preferable to I if the parameters p, | JLL |, c and C 
obey the conditions 

(P- 1^1 -) 2 = 2Cp, p-jMl -f = 0; 

hence (taking the positive square root because of the second in- 
equality and dividing by p): 

~P~*2 

That is, if best rules of action and communication are used, the 
team shall have an observer only if, relative to the variability 
(measured by p) of external events x, 1) the costs c and C are 
small and 2) the expectation JUL of x is not too far from zero 
(i e., if favorable events are not much more, and not much less, 
likely to occur than unfavorable ones). (See Figure 3.) 



5. OPTIMAL INEXACT INFORMATION 

5.1. In Sections 3 and 4 the observer could send either an 
exact report or none at all. In Section 2, however, the problem 
was defined more generally. Information can be given with vary- 
ing degrees of precision. 

The organizer of the team can determine in advance a parti- 
tioning P of the set X of states of nature into disjoint subsets 
(possibly infinite in number) which we shall denote by 



The observer finds that x belongs to xi m ', say. He communi- 
cates to the actor the index m. This index will thus depend on x, 
the actual state of nature; and on p, the partitioning chosen in 
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Fig. 3. The "speculator's case." 



Region (triangular) where use of observer is advisable. 

123 
(With p = 1 and C = 0, 3, *, -K.) 



advance. The actor chooses then an action a that will maxi 
mize the conditional expectation 



E{u(a,x)|xeX p m )} - 



f u(a,x)dF(x) 
y(m) 



the denominator being the probability that x falls into the subset 
xt m ). Hence to the partition p and to the given functions u and 
F will correspond an expected payoff 
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U = U(D; u,F) = 2 max / u(a,x)dF(x) . 
*^ ma ^xl m ' 

The quantities U^ and U o defined in 3.1 were extreme cases. 
11=1^ if every X m ' consists of a single element of X. 
("exact information"). U = U o ("no information") if XJ, 1 ' =X; 

therefore U n = max / u(a,x)dF(x). We can thus define the 

a X ~ 

value of an inquiry corresponding to a particular partition Jj as 

w(p> = U(p) - U . 

The actor will be informed into which subset X^ m ' x falls, only 
if such communication costs less than w(p). 

5.2. Code. A given partition p determines a "code," ac- 
cording to which an observed state x will induce a signal 
m = m(x). The function m(x) may be called a "code." The code 
studied in Section 4 was a two-valued one: either "call" or "no 
call" depending on whether x falls into the one or the other of 
the two intervals into which X was partitioned. (Fig. 2.) A call 
as well as the absence of a call conveyed to the actor a certain 
information. We have seen that in order to implement the code 
two kinds of cost had to be borne: a constant cost C of main- 
taining the network and a (random) variable cost of using it. 
The variable cost had two values, c and zero, depending on 
which of the two signals ("call," "no call") was used. In a more 
general case, the variable cost may have more values: at most 

*' 



as many as there are subsets 

With the cost thus defined the "constitutional" problem of 
Section 4 can be extended as follows: how mapy observers shall 
there be in the team, and what codes shall they use? We shall 
not touch in this paper on a still more general extension that 
would introduce two or more actors. 

Two types of optimal partitions p of the set X present 
special interest: 1) an optimal system of intervals for each of a 
given number of observables; 2) an optimal set of observables. 

5.3. Optimal system of intervals. If x is a real number, and 
X an interval, X may be partitioned into sub-intervals. These 
may or may not be of equal length. If they are, their number n 
measures the precision of information. For example, if n = 10 k , 
k integer, variations of precision are expressed by the number 
k of decimal digits. We have then a sequence of partitions: 
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k = 0, 1, . On the lines of (3.1.1) the gross expected 
utility will increase, or at least not decrease, with increasing 
precision of information; while the costs of both maintaining 
and using the network will presumably also increase or at least 
not decrease. The problem is to find the optimal value of k. 

5.3.1. If x is a real vector of, say d dimensions, 

x = (xi, , x<i), the same principle applies: about each say 
the i-th of the d components, an inquiry of precision kj can 
be made. The expected gross payoff as well as the cost will de- 
pend on the vector (kj, , k^); and the vector maximizing ex- 
pected net payoff will be chosen. 

5.3.2. As pointed out by L. Tornqvist the optimal code will, 
in general, consist, for each real variable, of intervals of un- 
equal rather than equal length. (The unequal "brackets' 1 of pub- 
lished income statistics provide a well known example.) Thus 
the maximization of net expected payoff will be performed, not 
with respect to a precision measure k but rather with respect 
to the code function m(x), as defined in 5.2. 

5.4. Optimal set of observables. Another concept of pre- 
cision of an inquiry concerning a d-dimensional state of nature 
(x^, , xj) is determined by the components not necessarily 
real numbers about which information is sought, this informa- 
tion being exact. For example, if x^ represents future weather 
and X2 represents future prices, a farmer may wish to inquire 
about none or one or both of these, before making a decision on 
what to plant. Presumably both the gross expected payoff and 
the cost of maintaining and using the network will increase (or 
at least not decrease) if, to a given set of observables, an addi- 
tional observable is added. 

5.4.1. If some of the components (xj, , x^) are real 
numbers, the principles of 5.4 (choosing the variables about 
which information is sought) can be combined with those of 5.3 
(choosing the best system of intervals for each of these vari- 
ables). If all components are real numbers, 5.3.1 embraces 
both principles: ki = means that no information is given 
about the i-th component. 

5.5. An example of buying precision. Consider again the 
"monopolist's case" of 3.3: 

(5.5.1) u = a(2x- a). 
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We have seen that UQ = ji^, Uj = U o + a 2 . To simplify our il- 
lustration assume in addition that x is distributed uniformly 
over the interval = x ^ p. Then M = p/2, or 2 = p 2 /12, U o = p 2 /4, 
Ui = p 2 /3. Let us investigate the "buying of precision, 11 in the 
sense of 5.3, but without confining ourselves to the case when 
n, the number of sub-intervals, is 10^ (or 2^, etc.), where k = 
the number of decimal (or binary, etc.) digits used. The ob- 
server informs the actor about the interval into which x falls. 
That is, the observer gives the integer m, such that 

(5.5.2) l^n^n and H-llp^x^p. 

Thus m depends on n and x; m is the smallest integer not 

exceeded by . Upon learning ro. the actor chooses an action 

P 1 

in the midpoint of the interval: a - a(m,n) = p(m - ^/n, since, 

under our assumptions, this maximizes the conditional expecta- 
tion 

E{u(a,x)|m-l <nx/pim} 
which then takes the value 

U(m,n) = [a(m,n)] 2 = (m - \) 2 p 2 /n 2 . 

Hence, the (non-conditional) expectation of payoff that the actor 
has if he chooses a degree n of precision is 

U(n) = E U(m,n)/n = [n(n + lX2n + 1) . nfr^) + nl 
m=l n L J 



12n 

This agrees with the two extreme cases computed above: 
U(l) = U = p 2 /4 (case of "no inquiry"); 
U(oo) = Uj = p 2 /3 (case of "exact inquiry"). 
If the "value of precision" 

w(n) = U(n)-U(l) = p 2 (l ~) 



falls short of the cost of inquiry, inquiry will not be made with 
that precision. 
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We might develop our example further by assuming that both 
the cost and the precision depend on the size of a sample 

5 6. An extension to several observables. Let 



(5.6.1) u = a(2 



-f; y^-a), 



an extension of (5.5.1), with xi being distributed uniformly and 
independently over the interval < x 4 ^ PJ. Thus y j (a positive 
number) signifies the importance of and PJ the uncertainty about 
the variable xi. The actor chooses, for each i, the precision 
ni : i.e., he asks about the value of m^ that satisfies 



(5.6.2) -l<x i p i , 1-1, 




In particular, if ni = 1, no inquiry is made about x^ Thus we 
have the situation described in the last sentence of 5.4.1, By a 
reasoning extending that of 5.5, one can show that the precision 
ni for the variable xi becomes more valuable as its uncertainty 
Pi and/or its importance y\ increase. 

5.7. The role of statistical dependence. It can be further 
conjectured that, the larger the statistical dependence between 
xj and X2, the smaller is the advantage of learning about both 
over learning about only one of them. In the example that fol- 
lows, we drop the assumption of independent distribution of the 
x f s; and we consider only the case 5.4 where each variable 
can be learned either exactly or not at all. Let, as in (5.6.1) 
(but with y f s all equal) 

u = a(2xj + 2x 2 - a) , 

and assume xj to be two-valued, with the following probability 
distribution: 



(5.7.1) 



Pi! Pl2 



LP21 P22J 

If the determinant PU P22 - P21 Pi 2 is denoted by d, and the 
marginal probabilities by Pi = Pn + Pi2 = 1 - QI, P2 = Pll + P21 
= 1 - q 2 , then the above matrix of probabilities is equal to 
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" PI P 2 + d PI q 2 - d " 

(5.7.2) 

_P 2 QI - d QI 2 2 

Piqj measures the "uncertainty 11 about Xj; (it has its maximum 
for pi = 1/2, minimum for p 4 = 1 or 0). d 2 measures the sta- 
tistical dependence between xj and x 2 

Denote the maximum expected payoff by 
UQO if no inquiry is made 
Uu if inquiry is made on Xj and x 2 
if inquiry is made on xj only 
if inquiry is made on x 2 only. 

Then UQQ = max Eu = t E ( x l + X 2 )l 2 = (Pi + P2) 2 
a 

(5.7.3) 

Uu = E max u = E(XJ + x 2 r = UQQ + PI^I + P2^2 + 2d 
a 

To compute UIQ, write 

max al -a + 2E(xj + x 2 lxj = 1)1 = I E(XJ 4- x 2 |x^ = 1)1 = Uj ; 

max al-a + 2E(xj 4- x 2 |xj = 0)1 = [JB( x i + X 2 I X 1 = )J = u o> 
say. Then UJQ = u lPl + ^^l an( * therefore 
(5.7.4) U n = " d2 

We see from (5.7.3) that the value of inquiring about both vari- 
ables, compared with no inquiry at all, is the larger the larger 
the uncertainty p^ about each variable. We see from (5,7.4) 
that the advantage of inquiring about both variables, compared 
with inquiring about one only is the larger the larger those un- 
certainties and the smaller the statistical dependence (d 2 ) be- 
tween the two variables. I am not going to subscribe to a second 
newspaper if it mainly repeats what the first paper says, or if it 
always says the opposite I [As to the role of the algebraic value 
of d in (5.7.3), and, hence, of positive vs. negative correlation, 
this has probably to do with the concave nature of the assumed 
payoff function u with respect to a. But this is only a con- 
jecture.] 
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6, CONCLUDING REMARK 

The results obtained in our various examples seem to agree 
with common sense, but, I think, only with common sense re- 
vealed by hindsight, except possibly for those M exact minds who 
feel by a sort of instinct" who appeared in the quotation from 
Laplace in Footnote 1 of this papei*. I may remind the read- 
er that the problems solved here surely belong to the simplest 
team problems that one can imagine. As the networks and the 
payoff functions become more complex, as statistical dependence 
between observables is introduced, as non-additive costs are ad- 
mitted, as the simple three-phase models are abandoned in favor 
of models permitting feedbacks, etc., unarmed common sense 
will become more and more powerless, and precise logical 
analysis more and more necessary. Even though such analysis 
will always have to use simplified models while reality includes 
many "things that cannot be submitted to calculus, it gives the 
surest hints that can guide us in our judgment and teaches 
us to avoid illusions that ofttimes confuse us." 



FOOTNOTES 

1. Laplace adds: "it makes us appreciate with exactitude that 
which exact minds feel by a sort of instinct without being able 
ofttimes to give a reason for it. ... Even in the things that 
cannot be submitted to calculus, it gives the surest hints that 
can guide us in our judgment . . . and teaches us to avoid the 
illusions that ofttimes confuse us." 

2. Note: an ordering GI on Sis said to be complete if it has the 
following two properties: (a) if s, s f , s" are in S then s Gj s f 
and s f GI s" imply s GI s" (transitivity); (b) if s and s f are in 
S then s GI s 1 or s f Gi s (completeness). If (a) but not (b) is 
satisfied, the ordering Gi is said to be partial. 

3. See G. Debreu [4]. 

4. See [14], [10]. 

5. See [17], [10]. 

6. In an earlier paper a different definition of orders was given. 
An order was defined as a message restricting the actions of 
the subordinate to a certain subset of possible actions. The 
present definition resulted from discussions with Roy Radner 
and Herbert Simon 
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7. See L. Hurwicz [8]. 

8. This is easily shown formally for the case when x takes dis- 
crete values xi with probabilities pi(i = l, 2, ) By definition 

max u(a,Xf) = u(a,Xj) for all a,i. 
a 

Multiplying by Pi (all non-negative) and summing over i: 

2 Pi max u(a,Xj) > Spi ufoxj) for all a 
i a i 

E max u(a,x) E u(a,x) for all a. 

a 

The last relation holds also when a on the right side takes the 
value that maximizes Eu(a,x). This proves (3.1.1). Briefly, 
the average of maxima cannot be smaller than the maximum 
average. 

9. A suggestion by R. Radner. 
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No concept can be regarded as clear unless the form of 
statement in which that concept is used to assert a simple, 
singular proposition is completely determined. Since the form 
of a singular sentence should not depend on the particular in- 
dividual names appearing in it, it cannot be regarded as deter- 
mined unless it is possible, when the individual constants in a 
sample singular sentence are replaced by variables, to specify 
the class over which each variable is to range. In other words, 
a determination of the singular sentence form of a concept in- 
volves determination of the nature of the entities to which the 
concept is to be ascribed. 

This requirement may be left aside as long as a concept 
figures primarily in theoretical discussions. Equations can be 
studied formally as soon as the quantities involved are charac- 
terized with respect to their values; and order postulates can be 
laid down without inquiring into the nature of the entities being 
ordered. However, once the problem of empirical measurement 
is raised, the form of sentence in which the results of measure- 
ment are to be expressed must be determined. 

Although it may be obvious that both traditional and current 
treatments of the utility concept are not entirely satisfactory in 
this respect, I should like to review their limitations more ex- 
plicitly before going on to suggest a different approach. 

In classical analyses, utility is treated as a real-valued func- 
tion defined over points in n-space. The form of singular sen- 
tences for the concept might then appear to be something like: 
M The utility of x for entities 1, 2, . . . , n, in the amounts 
yj, y2, . . . , y n , is equal to u utiles. M In other words utility ap- 
pears here as a quantitative relation between an individual and n 
magnitudes, or, equivalently, as a non- quantitative relation 
("being the utility function of 11 ) between an individual and a func- 
tion of n magnitudes. Now this sort of formulation raises vari- 
ous questions. 

221 
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First, the magnitudes over which the quantity is defined are 
(usually) "timeless". The individual involved, however, is time- 
extended. But a magnitude cannot be unambiguously ascribed to 
a system extended in space or time without some further con- 
structions or conventions being made. Is static utility to be 
based upon an empirical assumption that a dynamic utility stays 
relatively constant during certain periods? Or is it to be a sta- 
tistical construction from such a dynamic quantity (e.g., a time- 
average)? If an underlying dynamic utility is assumed, is it to 
be taken as continuously characterizing the (n+l)-tuple (the in- 
dividual and the n quantities) during sleep, for example, or only 
at times of decision? And what about these quantities? Are they 
to be dated themselves, as stocks and flows in the typical analy- 
ses would be? And if so, are their "dates" to be identical with 
that on which the utility is ascribed to the individual? Until such 
a static concept is unambiguously related to time-dependent con- 
cepts we cannot be said to have a concept at all but only a partial 
set of specifications for one. 

Time indeterminacy is only one aspect of the more general 
question. Another aspect is seen in the fact that the number of 
dimensions, n, is left undetermined in economic discussions. 
Yet the dimensionality of the space over which any empirical 
magnitude is defined is a fixed formal property of that magni- 
tude. Indeed, the interpretation of the magnitude on each dimen- 
sion must be fixed and unchanging if the concept is to have a 
fixed, clear meaning. 

One reaction to this problem is to suppose that in principle 
utility is to be defined over a space of a very great number of 
dimensions, most of which are ignored as uncritical in any given 
problem. But is there any complete set of commodities or en- 
vironmental variables over which this ultimate utility can be 
conceived as being defined? It seems doubtful. In any case, 
until such a set is determined, the form of the singular sentence 
for a utility so conceived is indeterminate. Thus all statements 
concerning measurement must be regarded as ellipses for very 
elaborate assertions whose form is unknown. 

A further problem in connection with the conception of utility 
as defined over a (commodity) space, is that it makes difficult 
the intelligible measurement of money or any other single vari- 
able. The very conception of such a space is based on the sup- 
position that the utility at any point is not just the sum of sepa- 
rate utilities for separate commodities. 

Another reaction to the problem is put forward in what I shall 
call the portability doctrine. According to this view, utility should 
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not be regarded as defined independently of any problem or ex- 
perimental situation. Rather, the detailed interpretations and 
logical decisions concerning its form are to be made in the con- 
text of a given problem. The concept is to be treated as a sort 
of portable partial framework, carried from problem to problem, 
to be filled out in structure and exact significance according to 
the demands of the specific problem. This would render it unlike 
any concept in the physical sciences, although its supporters 
often compare the development of the utility concept to that of 
"temperature" and other physical magnitudes [4]. (Quantities 
such as temperature might be said to vary in meaning from 
axiom-system to axiom-system e.g., from statistical mechan- 
ics to thermodynamics or from one mode of measurement to 
another, but hardly from problem to problem.) In any case, 
measurements could be carried out only for this specifically 
interpreted IT utility M , not for utility in general. So the results 
would have no direct consequence for the general theory of 
utility. 

A third approach to the problem may be derived from the 
traditional "satisfaction" interpretation of utility. In the clas- 
sical analyses of consumer behavior, utility appears to be in- 
formally interpreted as a satisfaction caused by receiving the 
specified amounts of the commodities. The form of singular 
sentence for such a satisfaction concept might be: "The satis- 
faction caused in x at time t by receiving commodities 
1, 2, . . . , n, at the rates yj, y2, . . . , y n (measured at time t) 
equals s degrees. T V This has the same sort of difficulties as the 
previously considered forms. If s is just the amount of satis- 
faction due to these commodities, is total satisfaction due to 
other commodities besides? And is s to be added or otherwise 
combined with other satisfactions to arrive at a total satisfac- 
tion? These and other questions are obviated, however, if we take 
satisfaction as a state variable of the individual concerned in the 
same way that temperature is a state variable of a system or 
point within a system. No matter what temperature causally de- 
pends on in various circumstances, the form of its singular sen- 
tence is: "The temperature of system x (or point x) at time t is 
equal to T degrees." It is not: "The temperature of x at time t 
due to .... equals T degrees" (where the dots are replaced by 
any explicitly given list of physical magnitudes). If we choose to 
use a state variable, "pleasure "would be a better word than "satis- 
faction" since the latter term carries a suggested reference to 
external cause. The type-sentence for pleasure would then be: 
"The pleasure of x at time t equals p degrees." We would then 
have the empirical problem of determining what pleasure depends 
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on, rather than the pseudo- logical one of deciding what entities 
utility is to be ascribed to. However difficult such a task might 
appear, it would seem a more honest and healthy relocation of 
the problem. Science is better equipped to unravel intricate 
causal connections with the help of clear concepts than to inter- 
pret measurements of ill-defined quantities. 

So much for the traditional treatment. Turning now to the 
von Neumann- Morgans tern approach we see that their concept 
of utility is defined over the class of all probability combina- 
tions of events in a class of mutually exclusive events, conceived 
as capable of taking place in the immediate future of a given in- 
dividual x at a time t. Provisionally, this seems clear enough 
from the standpoint of singular sentence structure. Its form 
seems to be something like the following: M The utility of E for 
x at time t equals u utiles." This contains no dots nor indeter- 
minate number n. All we have to ascertain is the nature of the 
entities over which the variable E is to range. Are probability 
combinations of events also events ? Before looking into this 
question let us consider only the original set of alternative 
events. The word "event" is used in two distinct ways which it 
is essential to distinguish. For this purpose I shall make use of 
Rudolf Carnap's term "proposition" [l] for the sort of entity 
designated by a complete sentence. More suggestive synonyms 
might be "situation" or "state-of-affairs". That a certain coin, 
a, was tossed at a certain moment, tj, is a proposition, true or 
false. The class of tosses with that coin, on the other hand, is 
not a proposition. The class determines a frequency, but not a 
truth value. It may be regarded as a propositional function of 
time in the sense that it is designated by an expression ("coin a 
is tossed at t") which determines a proposition, true or false, 
for each specification of the time variable. Those probabilists 
who adhere to a relative frequency interpretation of probability 
use the term "event" in the sense of propositional function. Von 
Neumann and Morgenstern explicitly adopt the relative frequency 
view. Yet the events in the basic class are conceived as related 
to a "standardized moment. . . in the immediate future". This 
suggests that preference is basically conceived as holding be- 
tween propositions rather than propositional functions. This is 
borne out by the fact that the limitation is applied in order to 
avoid "the problems of preferences between events in different 
periods in the future". It is, of course, possible to rank proposi- 
tional functions at each moment according to preferences among 
the propositions obtained by specifying that moment. 
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However, this question becomes more serious when we con- 
sider what is involved in expressing a preference between two 
probability combinations of the members of the basic set. Sup- 
pose one combination consists of basic element A with proba- 
bility p and basic element B with probability 1 - p. If the 
probability involved is relative frequency probability, A and B 
can obviously not be propositions. Now suppose they are prepo- 
sitional functions. The probability of A must be the relative 
frequency of A with respect to some other propositional func- 
tion C (reference class). (For simplicity I ignore the problem 
of relative frequency limits in infinite classes.) Presumably, 
then, it is this event C that is being offered as an alternative 
to the events in the basic set rather than a probability combina- 
tion conceived in some more abstract sense. This seems borne 
out by the fact that in discussion of actual cases, where a person 
is presented with a probability combination involving event A he 
is informed that something like a roulette wheel will be spun if 
he chooses the combination and that A(tj) will be made true for 
a specified tj if the ball rests in a certain region of the wheel. 
For the subject to make a meaningful choice he must believe that 
the experimenter has the power and intention to fulfill his part of 
the bargain. That is, for a person to have a probability combina- 
tion pA + (1 - p)B as a genuine alternative to his basic set of 
mutually exclusive events (propositional functions) there must 
be possible for him the alternative of another event C exclusive 
from each of the others. For instance, C may be the event of 
accepting an offer made by a certain person, accompanied by a 
certain promise. Further, C must be capable of occurring at 
the standard future time; it must be such than an occurrence of 
C is always followed by occurrence of A or by occurrence of 
B, and finally, C must be such that the relative frequency of 
joint occurrences of A and C with respect to occurrences of C 
is p. This requirement, while possibly making clear the logical 
status of probability combinations of events as events themselves 
(propositional functions of time), seems to limit the application 
of the Neumann-Morgenstern approach to such matters as con- 
sumer tasteSo Alternatives arising in real life are seldom con- 
trollable by the outcome of a pre-set random device; this makes 
it misleading, if not meaningless, to consider choices between 
probability combinations of them. 

The reason von Neumann and Morgenstern give for adopting 
the relative frequency approach is that it yields a magnitude 
rather than an ordering. However this does not appear to be 
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exclusively a feature of the relative frequency approach since 
the work of Carnap [2], (also see [3]). 

Although Carnap 1 s work has not yet provided a very general 
theory of logical probability, the development of more general 
theories along the same lines seems inevitable, and it may be 
well to investigate its implications for the utility concept. 

First a few words about the concept of logical probability. 
Logical probability is a straightforward generalization of the 
basic concept of logical implication. If a sentence logically 
implies a sentence h, this means that in virtue alone of the 
formal meanings of "the sentences any possible world in which <e 
is true must be such that h also is true. In other words, the 
range of possible worlds in which e would be true is a subset of 
the range of possible world in which h would be true. The con- 
cept of "possible world 11 can be made precise for the purposes of 
the logic of a given axiom system in terms of certain strongest 
sentences (state-descriptions) formulable in the terms of the 
system. It seems then a natural and inevitable step to develop 
a theory covering the relation that holds between sentences 
whose ranges overlap, but without the simple containment cor- 
responding to logical implication. Here we would have something 
weaker than logical implication which we might call support or 
confirmation or logical probability. If a reasonable measure of 
range is found, this theory can become quantitative. Carnap has 
found such a measure for languages (axiom systems) involving a 
denumerable number of individual terms and a finite number of 
predicates of any degree. If m(h) is such a measure of range 
for the sentence Ih and jre is the conjunction of the sentences 

h and e, then we can take r-r as the logical probability of 

nue; 

the hypothesis h on the evidence . As might be expected this 
quantity takes the value 1 when logically implies h. This 
ratio bears an obvious resemblance to relative frequency but 
here there is no question of the "favorable event" h having a 
frequency. Being a sentence, it can only be true oFfalse. The 
multiplicity needed for arriving at a meaningful ratio comes 
from the ensemble of possible world states. I use the term "en- 
semble" here deliberately to suggest to those familiar with the 
use of this term in statistical mechanics that it may be logical 
probability rather than relative frequency probability which is 
used implicitly in statistical mechanics. 

Using the concept of logical probabiltiy, it seems possible to 
sketch out the form of a utility concept which takes propositions 
as its principal arguments rather than either indeterminate 
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numbers of commodities or events in the relative frequentist's 
sense, and which is potentially wider in its field of application 
than the latter. Since logical probability is in no sense a sub- 
jective probability, the concept first defined will be normative, 
a rational utility suitable for the definition of rational preference 
and rational decision. After this, the possibility of using a simi- 
lar schema for a non-normative empirical utility will be dis- 
cussed. 

Although it is not essential to this approach, I shall couch my 
discussion in hedonistic terms for three reasons. First, be- 
cause pleasure is a state variable having a clear logical form, 
it can serve as a model of form for alternative formulations. 
Secondly it seems of interest to see that the difficulties noted 
earlier in connection with the attempt to base utility on satisfac- 
tion can be avoided by use of probabilistic considerations. Third- 
ly, this procedure may suggest the possibility that some similar 
sort of psychological interpretation may offer ways of arriving at 
empirical utility estimates in addition to observing actual deci- 
sional behavior, though without making the choice-potential ap- 
proach more difficult or less valuable. 

Let P(x,t) be the pleasure or "hedonic tone" of x at time t. 
For any x this determines a time series which we may picture 
oscillating irregularly about a normal position from birth to 

death. Let 

ft+s 

H(x,t, S ) = P(x,T)dT. 



H(x,t,s) may be called the prospective happiness of x during s 
beginning at t. Of course, hedonist x at time t will not know 
the value of H(x,t,s) for any positive s, but he may be able to 
form estimates of it and he will hope that it is as large as pos- 
sible 

With logical probability we can form a concept corresponding 
to mathematical expectation of a magnitude z which we may call 
the estimate of z with respect to given evidence e. Let E(z,w,e) 
be the estimate of magnitude z for the argument w on the basis 
of the proposition e. (I shall use "e" as a variable over propo- 
sitions rather than over sentences, in contradistinction to Carnap. 
With the development of modal logic this becomes a perfectly 
feasible alteration conducive to simplicity of discussion without 
affecting the fundamental idea.) 

Let e(x,t) be a proposition representing the total experience 
x has accumulated by time t (the concept being defined makes 
the fiction that x has total recall). We may now lay down the 
following definition. 
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The proximate rational utility of proposition p for x at t 
with respect to time-span s equals u utiles if and only if p is 
logically compatible with the total experience of x at t and the 
estimate of his prospective happiness during s beginning at t 
on the basis of p together with his total experience at t equals u. 

Using the symbols already defined, together with the sign Q 
from modal logic for logical possibility (see p. 175, [l]) this can 
be written out: 



U(p,x,t,s) = u = dftP'etXptJJ-tE^^t^^-et^t)) = u) 
where the dot denotes conjunction of propositions. 

Absolute utility could be defined with respect to total pro- 
spective happiness: 

U(p,x,t) = df U(p,x,t,oo). 

Proximate utility seems a more useful concept, however, 
since absolute utility might vary too slightly between proposi- 
tions to be of help in practical decisions. 

The above formulation does not purport to be a definition of 
utility but rather a sketch of one possible form of such defini- 
tion, which would confer a clear logical structure on the utility 
concept. The essential point is the employment of propositions 
as the entities to which utility is attributed. This permits utility 
to be attributed to a situation consisting just of receipt of d 
dollars by xj at tj without raising questions of complementarity 
or the need to specify a total world future of which this was a 
part; for the definition requires that complementarities, possible 
world futures, and the like, in so far as x can determine them 
on the basis of e(x,t), are taken into account in the estimation. 

Propositions can be as simple or complex as we please and 
hence provide an extremely flexible base for theory or measure- 
ment (once they become well-defined through specification of the 
details of the language system employed). Indeed, the effects are 
about what could be hoped for from the portability doctrine with 
the difference that we would have here, at least potentially, a 
single, completely interpreted concept. Furthermore, instead 
of being restricted to sets of mutually exclusive acts or strate- 
gies we could attach a utility to any propositions not contra- 
dicted by our experience. We could then decide our preferences 
among them according to this utility regardless of whether they 
are within our control or whether they are mutually exclusive. 
For example, p and p-q are not mutually exclusive but it might 
be rational to prefer p to p-q if q is unpleasant, simply because 
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p does not entail p*q. It would seem essential to rational con- 
duct to be able to evaluate situations directly without regard to 
whether they are possible consequences of our actions. And it 
seems not inconsistent with normal experience to feel prefer- 
ences even among past events or among general laws as long as 
our own experience does not directly contradict them. On the 
other hand it should be noted that the suggested definition con- 
fers equal utility for x on all propositions known by x to be 
true. I am not sure how this corresponds to common feeling on 
the matter but it could not affect any actual decision behavior. 

Using symbols already introduced, rational preference (x 
rationally prefers proposition p to proposition q at time t) 
could thus be defined as follows: 

P(x,p,q,t) =df <>(P-e(x,t) ) Ofa'(M) ) (U(p,x,t) > U(q,x,t) ) 

In fact the measure of this preference could be taken to be the 
difference of the utilities. 

A definition of rational decision among courses of action could 
be based on the above definition if one could define some such 
concept as "p is a possible course of action for x at t" where 
"possible" is meant in the ordinary, not the purely logical, sense. 
It should be noted that Carnap f s formulation of Bernoulli's rule of 
rational decision, which rests upon an undefined concept of utility, 
has been made the basis for the present definition- schema for 
utility itself. 

The fact that rational utility is here sketched as defined di- 
rectly, with rational preference as a derivative concept, does 
not mean that measurements of empirical utility might not rest 
heavily on observed preferences. Nevertheless, I would be in- 
clined to define empirical utility in basically the same way. In- 
stead of using strict logical probability based on a Carnap 
measure function though, I would suppose that the individual's 
subjective probability could be represented by a measure func- 
tion varying in certain ways from the norm; and instead of taking 
his total experience, I would take that part of it which he actually 
brings to bear in making his choice. Using such a definition, in 
fact, one might be able to reason backward (under certain con- 
trolled conditions) from observed choices to partial determina- 
tion of a man's information, his proximate objectives, or the 
peculiarities of his subjective probability measure. On the other 
hand, one would not be limited to observation of overt choices. 
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CHAPTER XVI 

TENDENCIES TOWARD GROUP COMPARABILITY 
IN COMPETITIVE BARGAINING" 

by 
Paul J. Hoffman, Leon Festinger, Douglas H. Lawrence 

STANFORD UNIVERSITY, UNIVERSITY OF MINNESOTA 
STANFORD UNIVERSITY 

Introduction 

The present study is concerned with some of the socio- 
psychological factors that determine behavior in a situation 
where some ability is being revealed or measured. In such a 
situation, in order to evaluate their ability, persons frequently 
tend to compare their own performance with the performance of 
others whom they accept as comparable to themselves. As a 
consequence, the individual's behavior is determined more by 
his performance relative to those others than by the absolute 
level of his performance. If the situation in which the particular 
ability is measured is a bargaining situation where coalitions 
can form, it is possible to predict both the type and strength of 
the coalitions that form, and the relative stability of these when 
the composition of the group and the importance of the task for 
the participants is varied. 

A motivational analysis of this type has implications, not 
only for social psychological theory but also for the theory of 
games [ 2] . This is seen most clearly when "rational" bargain- 
ing behavior in the typical game situation is analyzed under the 
assumption that the only motivation present is a tendency on the 
part of each player to maximize the total number of points he 
obtains. Imagine a game involving three players where a fixed 
amount of money, points, or some commodity is to be divided 



*This study was conducted under contract N onr 225 (01) between 
the Office of Naval Research and Stanford University, and in co- 
operation with the Laboratory for Research in Social Relations 
of the University of Minnesota. The theoretical formulations on 
which the study is based were developed by Dr. Leon Festinger, 
and he will extend them in a forthcoming article. 
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among them. The rules of the game are such that no one indi- 
vidual can obtain the total amount by his own efforts. If any two 
of them agree to cooperate they can share the amount on any 
basis satisfactory to the two of them. 

Procedure. In addition the following conditions are assumed: 
(1) that all individuals are equally and singly motivated to ob- 
tain as large a share of the points as possible, and (2) that they 
are all equal in bargaining skill. Two things should be noted 
about this game. First of all, while it is mandatory for two of 
the three players to form a coalition in order to obtain points, 
a given player has no rational basis for selecting between the 
other two members in attempting to form the initial partner- 
ship. Hence, in the long run, each of the three possible coali- 
tions in this group will be formed equally often. The second 
point to note is that there is no logical termination to the bar- 
gaining process. Each player can break up any coalition formed 
in opposition to him by offering one player more points than he 
is getting from his present coalition. Hence an endless process 
of bargaining would develop and be maintained. 

This result, however, is contrary to everyday experience. It 
is to be expected that coalitions will form between particular 
pairs of players more frequently than between others and that 
these will be of a relatively stable and permanent nature. The 
difference between the "rational" prediction illustrated above 
and common psychological expectation centers about the nature 
and variety of the motivations operating in the bargaining situa- 
tion. 

When the sole motivation present is the desire to obtain a 
maximum number of points and when the players have equal 
bargaining skill, all coalitions between members are equivalent 
and equally likely to occur. Predictable and stable coalitions 
can form, however, when there is a variety of motives operating 
and their strengths are unequally distributed among the players. 
Then even though two players make identical offers to a third in 
terms of the absolute number of points involved, these offers 
are not equivalent in value for that player in terms of the other 
satisfactions they can provide. This results in the possibility 
that a particular individual may find himself in an advantageous 
position relative to the others. His offers may carry both the 
potentiality of points and the potentiality of other satisfactions. 
To the extent that a second individual requires both of these 
types of satisfaction, there is a high probability that he will 
accept offers from the first player. Moreover, the player left 
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out of such a coalition will be relatively impotent in his attempts 
to break it up as long as the desire for points is subordinate to 
the other motivations of the coalition members. From this 
point of view, it is obvious that coalitions may in fact become 
predictable and stable. 

The hypotheses underlying the present experiment are first, 
that these additional motivations in a bargaining situation arise 
in part from each individual's concern about his comparability 
to other members of the group on the ability that is shown in 
the bargaining. This results in pressures to achieve uniformity 
in the group. In our culture, at least, there are also motiva- 
tions to strive to be better than others which operate simul- 
taneously with the pressures toward uniformity. The contrast 
between these types of motivation and the motivation to obtain a 
maximum share of the points is seen most clearly in situations 
where the points involved are ratings, prestige objects or other 
symbolic representations of relative status. The individual's 
concern about them arises not because of their absolute or in- 
trinsic value, but because in this activity they are indicative of 
his status relative to others. In a competitive society, such 
concern about the status significance indicated by the symbols 
tends to operate even where money or other commodities of 
direct utilitarian value are involved. 

The influence of the concern about relative status on the* 
formation of coalitions can be shown by referring to the previ- 
ously described game. Assume that one of the individuals ob- 
tains an early initial advantage of a number of points while the 
others at this stage have few or none. These points function as 
symbols indicating to the others that this person is superior to 
the other two in that activity. The result is an increase in the 
motivation of the other two individuals of the group to draw 
close to him. Consequently they will be strongly motivated to 
form a coalition against him insofar as this aids in overcoming 
his lead. Furthermore, they will resist any attempt on his part 
to disrupt this coalition even though by accepting his offers 
either one of them might gain more points than he is able to ob- 
tain by remaining in the coalition. This happens because a con- 
flict has developed between their desire to acquire the maximum 
number of points possible, and their desire to reduce the dis- 
crepancy between themselves and others. The refusal of offers 
from a person with the large initial advantage in points indicates 
that the conflict has been resolved in favor of the desire to re- 
duce the discrepancy. 

This analysis is based on the hypothesis that a major 
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motivation of each individual is to compare himself with and to 
draw close to or surpass the others in the group. The reference 
individuals with whom each compares himself, however, are not 
selected indiscriminately. The amateur golfer does not com- 
pare himself with the professional, nor is the occasional bridge 
player concerned with his status relative to the expert. The 
conditions determining which individuals are selected as a ref- 
erence group are not clearly defined as yet, but in general an 
individual is concerned about his status relative to others whom 
he considers of approximately equal ability. He tends to exclude 
from his comparisons individuals who appear definitely superior 
or inferior to him in this activity. This is identical to the 
process of "rejection* when pressures toward uniformity arise 
concerning opinions. There the individual does not evaluate his 
opinions by comparison with others who are too divergent. 
Similarly, individuals do not evaluate their abilities by compari- 
son with those who are too divergent [1] . 

Conditions in which all members of a group regard them- 
selves as comparable are known as "peer" conditions. Condi- 
tions in which one or more individuals are regarded by the 
others as non-comparable, i.e., as definitely superior or in- 
ferior, are known as "non-peer" conditions. In terms of the 
previous analysis, it is expected that in peer conditions pre- 
dictable and stable coalitions will form in opposition to the indi- 
vidual who has an initial advantage. In non-peer conditions, 
however, this tendency will be reduced because the other mem- 
bers of the group are motivated to surpass each other and are 
less concerned with the status of the non-comparable individual. 
Even though the tendency to compete with the non-comparable 
individual is reduced, it does not disappear because the condi- 
tion of non-comparability is relative and continuous, rather than 
all or none. 

There is a second important determinant of decisions involv- 
ing the formation of coalitions. It is obvious that the desire to 
achieve and maintain relative status will vary depending upon 
whether or not the individual regards his status on that task as 
important. The golfer or bridge player will be concerned about 
status differentials which develop in the play of these games but 
not on a wide variety of others. Similarly, performance on an 
activity which requires the use of intellectual abilities will re- 
sult in a motivation to achieve status in direct proportion to the 
degree to which this type of activity is considered important by 
the individuals involved. Increasing the importance of a task in 
effect increases the individual's concern about any status 
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differences that may develop with the result that he becomes 
even more motivated to equal or surpass the others. Conse- 
quently, he becomes even less willing to cooperate with anyone 
having an initial advantage. Thus under conditions of high task 
importance the coalitions become highly stable. In contrast, 
conditions of low task importance may result in a situation in 
which incidental motivations are as strong as the concern about 
relative status. In this case the stability of the coalitions is 
reduced. 

In summary, it is postulated that an important motivation in 
a bargaining situation is the individual's concern about his sta- 
tus in the activity relative to other members of the group and 
his desire to equal or surpass them. The accumulation of 
points is significant for him because these represent the degree 
of comparability among the participating individuals. Conse- 
quently, predictable and stable coalitions in opposition to an in- 
dividual having an initial advantage will occur more frequently 
when that individual is a peer member of the group than when 
he is a non-peer member. Similarly, increases in the value 
placed on the points as a result of making the task important to 
the individual increases his concern over comparability and in- 
directly his tendency to form stable coalitions in opposition to 
an individual who obtains an initial advantage. These predictions 
are tested by the following experiment. 



SUBJECTS AND PROCEDURE 

Twenty-eight groups of subjects participated in the original 
experiment which was performed at Stanford University. The 
entire experiment was then replicated with twenty-eight groups 
at the City College of San Francisco. Subjects were under- 
graduates, drawn from the introductory psychology courses. 
The two groups were alike with respect to age, but the Stanford 
subjects were of a higher socio-economic level and probably of 
higher average intelligence because of the differences in en- 
trance requirements at the two schools. Stanford subjects also 
appeared to be somewhat more mature and less naive than the 
San Francisco students, most of whom had had no previous ex- 
perience as subjects in psychological experiments. Only male 
students were used. 

The twenty- eight groups in each replication were randomly 
but equally distributed between the four experimental conditions: 
(1) high task importance and peer relations, (2) high task 
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importance and non-peer relations, (3) low task importance and 
peer relations, and (4) low task importance and non-peer rela- 
tions. Of the three individuals in each group, two were sub- 
jects and the third was a paid-participant trained beforehand in 
the role he was to play. Two paid-participants were selected 
from Stanford for use in the Stanford Experiment, and three 
from San Francisco City College for participation with those 
groups. These were male seniors who were unknown to the 
subjects. When the two subjects and the paid-participant ar- 
rived for the experiment, a check was made to see that the two 
subjects had no more than a passing acquaintance with each 
other and that neither knew the paid-participant. In cases where 
this was not true, the group was discarded. The three group 
members were seated around a table with the positions clearly 
labeled as A, B, and C. The paid-participant invariably was 
seated at A. 

Each group was informed that the purpose of the experiment 
was to collect standardization and validation data on a new type 
intelligence test consisting of three parts. Subjects were told 
that the first part of the test was similar to the usual type of 
paper and pencil intelligence test, the second measured the 
ability to interact with others, and the third was designed to 
measure insight into one's own behavior and the behavior of 
others. In actuality the first was a paper and pencil intelligence 
test included to lend credibility to the situation, the second was 
the bargaining situation or test proper, and the third was a 
questionnaire to provide independent evidence that the experi- 
mental manipulations had changed the individuals 1 attitudes and 
perceptions. The low versus high task importance and the peer 
versus non-peer conditions were established in the following 
ways: 

1. Task importance. The importance of the task to the sub- 
jects waslnanlpulafedf in two ways: (1) by the instructions given 
them concerning the validity of the test purported to be measur- 
ing intelligence, and (2) by the content of the paper and pencil 
test. In the high importance groups, the instructions given them 
just prior to beginning the experiment were as follows: 

We have asked you to come here today to take a new 
intelligence test which has recently been devised and 
which has shown itself to be highly superior to the usual 
kind of intelligence test. We have scheduled three of you 
together because, although some parts of the test are 
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taken individually, other parts of the test require inter- 
action among three persons. The test is separated into 
a number of parts and we will explain each part to you 
when we come to it. 

Let me explain to you why we are asking you to take 
this test. Recent research in psychology has produced 
new knowledge about intelligence and intelligent behavior 
in people which has enabled psychologists to construct 
this new test. It has been tried out with many different 
kinds of people and in every case has been shown to be 
greatly superior and more valid in measuring the intelli- 
gence of individuals than the older type test. Needless to 
say, when such an important development occurs it is 
extremely valuable to accumulate as much data using the 
test as possible. As the test has not yet been used with 
people on the West Coast, we are especially interested in 
the data we will collect here. 

After we measure your I.Q. with this test we will com- 
pare it with other records we can get on you and with 
scores you have made on the older kinds of tests. There 
will be some of you for whom we do not have sufficient 
records. If that turns out to be the case we may have to 
ask you to take some other tests sometime within the 
next month. The results so far with this new test indicate 
that now, for perhaps the first time, we can really meas- 
ure how intelligent a person is with an extremely high de- 
gree of accuracy. We shall of course be glad to inform 
each of you about your I.Q. after we have scored the test. 

This emphasis on the validity of the test was underscored by 
the content of the items given the high importance groups during 
the pencil and paper part of the experiment. The printed booklet 
given these groups contained 24 synonym-antonym items and 20 
verbal analogy items drawn from the Terman Concept Mastery 
test, a section from the paragraph comprehension section of the 
Ohio State Psychological Examination, and 10 items from the 
Minnesota Paper Form Board. A ten minute time limit was im- 
posed on this test. 

The instructions for the low importance groups were designed 
to belittle the validity of the test. After the same introductory 
paragraph as used for the high importance group, the instruc- 
tions were as follows: 

Let me explain some things to you about why we are 
asking you to take this test. The psychologist who 
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published the test claimed that it was useful in measuring 
intelligence. Other people, trying the test out, have dis- 
puted this claim and have shown by their results that it 
has nothing to do with intelligence. In fact their results 
seem to show that it has nothing to do with anything. We 
have decided in the department here to do some very 
careful research to settle once and for all whether or not 
the test is any good. We have already given the test to 
large numbers of people, and, comparing their scores on 
this test with scores on other tests, with grades, and with 
many other measures, we are quite convinced that the 
present test which you will take is pretty meaningless. 
Nevertheless we want more data so that when we publish 
our results there will be absolutely no question about it. 

This lack of validity was underscored by the content of the 
printed booklet given the low importance groups during the 
pencil and paper part of the test. The improvised items were 
of the following types: (a) general information of an extremely 
low difficulty level, (b) items requiring value judgments on 
moralistic questions, (c) items requiring judgments of occupa- 
tion from facial expression, (d) items requiring judgments of 
emotion from facial expression, and (e) jokes to be rated in 
terms of their humor. 

2. Peer versus non-peer. The peer condition, in which each 
subject was to regard the other subject and the paid-participant 
as comparable to himself in intellectual ability, was established 
in part by instructions and in part by the behavior of the paid- 
participant. These instructions were given just after the com- 
pletion of the pencil and paper test and just prior to the bargain- 
ing situation. For the peer condition they were as follows: 

Before we start the next part of the test, I would like 
you to know some of the reasons for scheduling you par- 
ticular three persons together in the same group. This 
next part of the test requires that in each of our groups 
the three persons should be approximately equal in intel- 
ligence and mental ability insofar as this can be roughly 
determined in advance. We consequently have taken the 
liberty of looking up your grades, your various achieve- 
ment and aptitude test results, and as much else as we 
could get about you. We are reasonably certain that you 
three are very close together in intelligence as measured 
by those tests. 
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In this next part of the test you will see why it is nec- 
essary to have the three of you matched so closely in in- 
telligence. The next part requires that each of you deal 
with the other members in the group and consequently it 
is necessary that all three of you be as equal in intelli- 
gence as we could manage. 

The paid-participant emphasized his equality with the others 
by pacing himself during the pencil and paper test at the same 
rate as the two subjects in the group. 

In the non-peer groups, where the two subjects were to re- 
gard the paid- participant as definitely superior to themselves 
in intellectual ability and therefore as non-comparable, the in- 
structions were as follows: 

Before we actually start taking the second part of the 
test, I would like you to know some of the reasons for 
scheduling you particular three persons together in the 
same group. We wanted to be sure that in each of our 
groups that take this test, there was at least one person 
of very superior intelligence. Now one of you here has 
taken an intensive battery of tests earlier in the quarter, 
and we asked specifically that he sign up for this hour. 
The person in this group who took this intensive battery 
of tests earlier is the one of extremely superior intelli- 
gence. 

In this part of the test, part "B", you will see the rea- 
son we were so careful to be sure that there was at least 
one person in the group of extremely high intelligence. 
The next part of the test involves dealing with others in 
the group and in the way the test is standardized, it is 
necessary that such discrepancies in intelligence among 
you exist. 

As neither of the two subjects in the group had taken an in- 
tensive battery of tests, it was assumed that each would conclude 
that either the other subject or the paid-participant must be the 
one of extremely superior intelligence. This speculation was 
then directed at the paid-participant as the result of his subse- 
quent behavior. During the paper and pencil test, he worked 
through the booklet easily and quickly, turning it in well before 
the expiration of the time limit. 

Following these instructions, the next part of the test was the 
bargaining situation. Each of the three members of a group was 
given a set of triangular pieces cut out of masonite, the sets 
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differing from each other only in color. By assembling these 
pieces correctly it was possible for each member of the group 
to form an individual square requiring six of the seven pieces 
provided him. The seventh piece was a large right-angled 
isosceles triangle. Any combination of two players could form 
a "group" square by combining their large triangles. This 
group square had a larger area than did the individual square. 

The bargaining was governed by a set of rules read to the 
subjects in advance of the trials. These rules emphasized that 
the objective of the players was to earn points, since these 
points were to be added to the scores on the paper and pencil 
test to determine the I.Q. There were to be a series of five 
trials, each of four minute duration, and on each of which it was 
possible to earn as many as eight points. These could be earned 
in one of two ways. If a person assembled his individual square 
and no other squares were formed on that trial, the square was 
worth eight points. If two persons combined pieces to form a 
group square, this square was also worth eight points, provided 
that the two partners agreed on how they would divide these 
eight points between themselves. It was permissible for either 
of the two persons in an agreement of this sort to break it at 
any time during the trial and to enter into a new agreement with 
the third person in the group. In the event that more than one 
square was formed in a given trial, only the largest square 
would win. In case two squares of the same size were formed, 
no one would get any points. There was one exception to these 
rules. If any person succeeded in forming the individual square 
on the first trial, he automatically won that trial and in addition 
received a bonus of twelve points which he could divide among 
the three players in any way he wished. 

These rules achieved the following results. On the first trial 
while the situation was still somewhat unstructured for the sub- 
jects, each of them attempted to form an individual square be- 
cause of the bonus of twelve points offered. The problem was 
so difficult, however, that only the paid-participant was able to 
do it. He always decided to keep the points for himself, thereby 
obtaining an initial lead of 20 points over the other two mem- 
bers. Because of his manipulation of the pieces, it was always 
clear to the other two subjects by the end of the first trial how 
they should go about constructing their individual squares. But 
since the assembly of an individual square by either of the re- 
maining group members would result in a tie, with no points 
awarded, this solution became functionally useless for the re- 
maining four trials. The only possibility remaining was for two 
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players to form a group square from the large isosceles tri- 
angles. The paid-participant emphasized this point at the be- 
ginning of the second trial by first forming such a square with 
individual B and then with C as though he were just exploring 
the possibilities in the situation. As this group square was ob- 
viously larger than the individual square, it would always win 
the points when formed. 

The bargaining behavior of the paid-participant during the 
second through the fifth trial was predetermined. At the be- 
ginning of the trial he offered to make the group square with B 
and to give him four of the eight points. From this point on his 
behavior was governed by the following rules: 

1. If B said "yes", A rested until something else happened. 
If B said "no" A waited for a moment until some agree- 
ment between B and C had been reached. 

2. If B and C did not reach an agreement, A offered B five 
points. If the offer was refused, he offered six. If this 
was refused, he offered seven. After any acceptance, he 
rested until something else occurred. If the offer of seven 
was refused, he began directing offers to C in the same 
sequence. 

3. If the person left out made A an offer, A accepted it if it 
gave him more points. He did not take the initiative as 
long as he was in a coalition. 

4. If the BC coalition formed on an even split, A proceeded 
as in 2. 

5. If the BC coalition formed on an uneven split, A made an 
offer of four to that player who was getting the least. If 
the answer was "yes", he rested. If "no", he proceeded 
as in 2 continuing to direct his offers to the same person 
until an offer was accepted or until an offer of seven 
points was refused. In this latter case he began directing 
his offers to the other coalition member. 

The purpose of these rules was in part to make the bargain- 
ing behavior of the paid-participant appear natural, but at the 
same time to ensure that when forced he would always offer up 
to seven points. In this way the strength of the coalitions 
against him could be measured. As each offer was made, it 
was recorded sequentially by the experimenter in such a man- 
ner as to indicate the size of the offer, by whom it was made, 
toward whom it was directed, and whether it was accepted, 
ignored, or rejected. In the Stanford experiment, the subjects 
were given a warning 30 seconds before the end of each trial. 
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This was eliminated in the San Francisco replication since its 
effect was to materially reduce the amount of bidding within the 
period prior to the warning. 



RESULTS 

The results of this experiment are described under three 
headings. First the data from the questionnaire are analyzed 
to determine the extent to which differences in task importance 
and in peer relations were actually established between the 
various groups. Following this, the results bearing on the 
choice behavior during the last four trials of bargaining are 
presented under two headings. The first of these covers termi- 
nal coalitions, i.e., the agreements which existed at the end of 
a trial and thereby determined the distribution of points. The 
second presents the results pertaining to temporary coalitions, 
i.e., those tentative agreements existing prior to the formation 
of the terminal coalition. 

The Success of the Experimental Manipulations 

One item of the questionnaire given as the final part of the 
test had each subject rate his own I.Q. and that of the paid- 
participant on the assumption that discrepancies in these ratings 
would be indicative of the extent to which a given subject re- 
garded A, the paid-participant, as comparable to himself. The 
mean discrepancies in I.Q. ratings (rating of paid-participant 
minus self -rating) are evaluated in the analysis of Table 1. 
The subjects' rating of A in the peer conditions averages 3.27 
I.Q. points more than the subject's rating of himself, but in the 
non-peer condition it averages 13.00 points more. This differ- 
ence is significant (P < .001). Discrepancies are not signifi- 
cantly different for comparisons involving task importance, 
schools, or interactions. The assertion can therefore be made 
with a high degree of certainty that perceived differences in 
comparability were in fact established between the peer and 
the non-peer conditions, such differences being based upon 
assessment of intellectual status. 

A second item in the questionnaire had the subject evaluate 
the bargaining situation as to the degree of validity they believed 
it to have as a measure of intelligence. It is possible to infer 
from these ratings the degree to which subjects considered 
their performance in the bargaining situation as important, and 
thus the extent to which they were thereby motivated. The 
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Table 1 

Analysis of Discrepancies Between Subject's Rating 
of Paid-participant and Rating of Self on I.Q. 



Source 


d.f. 


Variance Est. 


P 


Importance 


1 


11.16 




Peer vs. Non-Peer 


1 


5,304.02 


<.001 


Schools 


1 


75.45 




Interactions 


4 


211.66 




Error 


48 


92.70 





obtained ratings on this item are evaluated in the analysis of 
Table 2. Subjects assigned to conditions of high task importance 
tend to rate the bargaining procedure as a more valid measure 
of intelligence than do subjects assigned to conditions of low 
importance (P < .001). In addition the difference attributable 
to the replications of the experiment is significant (P < .01), 
with the San Francisco subjects rating the bargaining situation 
as more valid than do subjects of the Stanford experiment. 
Since the San Francisco subjects represented a somewhat less 
select group in terms of college aptitude, it is reasonable to 
expect that they would be more concerned over their intellectual 
status and would consequently ascribe a higher importance to 
the task. 

Table 2 

Rated Validity of Bargaining Situation 
as a Measure of Intelligence 

Source d.f. Variance Est. P 



Importance 


1 


46.29 


<.001 


Peer vs. Non-Peer 


1 


0.57 




Schools 


1 


7.00 


<.01 


Interactions 


4 


0.14 




Error 


104 


0.76 





The Formation of Coalitions 

Assuming that the groups were differentiated with respect to 
the importance of the task and the degree of comparability be- 
tween members of the group, the major question is the influence 
of these variables on the formation of coalitions. A relatively 
direct measure of the extent to which coalitions were formed in 
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opposition to the paid-participant is the discrepancy between 
the number of points he was able to obtain on each trial and the 
number he would be expected to obtain if the coalitions were 
formed on the basis of chance. On a chance basis the paid- 
participant, hereafter referred to as A, would be expected to 
receive on the average a third of the eight points available for 
division, or 2.67 points per trial, as would each of the two sub- 
jects, B and C. In the event that A receives significantly less 
than the average of the other members of the group, he is being 
discriminated against by them as far as their willingness to 
form coalitions with him is concerned. 

The average number of points per trial earned by A in the 
last four trials under the various conditions of the experiment 
is shown in Table 3, along with the statistical analysis of the 
sums on which the averages are based. 

Table 3a 
Average Points Per Trial Earned by A 

Impor. School Peer Non-Peer Avg. 



High 


S.F. 


1.29 


1.75 


1.52 




Stan. 


1.57 


2.39 


1.98 




(Avg.) 


(1.43) 


(2.07) 


(1.75) 


Low 


S.F. 


1.32 


2.54 


1.93 




Stan. 


2.50 


4.36 


3.43 




(Avg.) 


(1.91) 


(3.45) 


(2.88) 


(S. F. Avg.) 




1.30 


2.15 


1.72 


(Stan. Avg.) 




2.04 


3.37 


2.71 


(Avg.) 




(1.67) 


(2.76) 


(2.21) 



Table 3b 
Analysis of Average Points Per Trial for A 

Source d.f. Variance P 



Importance 


1 


200.65 


<.02 


Peer vs. Non-Peer 


1 


274.57 


<.01 


Schools 


1 


208.28 


<.01 


Interactions 


4 


39.25 




Error 


48 


24.89 
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It is apparent from Table 3 that A is generally unable to ob- 
tain a chance number of points under the various conditions. 
The one exception is the condition of low importance and non- 
peer relations at Stanford, in which A receives significantly 
more points than would be expected. Under each of the two rep- 
lications of the experiment the pattern of results is essentially 
the same. A obtains more points under low importance than 
when task importance is high (P < .02), and more under the non- 
peer conditions than under peer conditions (P < .01). These dif- 
ferences are in the direction predicted by the theoretical assump- 
tions underlying the experiment. It should be noted that in each 
experimental condition A receives fewer points in the San Fran- 
cisco replication than at Stanford (P < .01). This consistent dis- 
crepancy becomes more meaningful when it is recalled that the 
questionnaire data gave evidence of higher importance being 
ascribed to the task in the San Francisco groups than at Stan- 
ford. The experiment might therefore be interpreted as includ- 
ing three different levels of task importance instead of two. 
This interpretation would reconcile the differences between the 
two replications. 

The failure of A to secure the number of points expected by 
chance is due to two factors: (1) he was unable to form a fair 
share (2/3) of terminal coalitions even though he was willing to 
offer as many as seven points, and (2) even in those coalitions 
of which he was a member he was unable to obtain a fair divi- 
sion (4 out of 8) of the points. The influence of the first factor 
is demonstrated in Table 4 where the average number of term- 
inal coalitions including A during the four trials is tabulated. 
The pattern of results is the same as that involving point totals 
for A. 

Table 4a 

Average Percent of Terminal Coalitions Having A as a Member 
Impor. School Peer Non-Peer Avg. 



High 


S.F. 


36 


46 


41 




Stan. 


57 


64 


60 




(Avg.) 


(46) 


(55) 


(50) 


Low 


S.F. 


43 


64 


54 




Stan. 


61 


86 


74 




(Avg.) 


(52) 


(75) 


(64) 


(S. F. Avg.) 




40 


55 


48 


(Stan. Avg.) 




59 


75 


67 


(Avg.) 




(50) 


(65) 


(58) 
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Table 4b 

Analysis of Average Number of Coalitions 
Having A as a Member 

Source d.f. Variance 



Importance 


1 


4.57 


<.05 


Peer Cond. 


1 


5.78 


<.05 


Schools 


1 


5.78 


<.05 


Interactions 


4 


0.18 




Error 


48 


1.01 





For all conditions combined, A is able to form fewer terminal 
coalitions than would be expected on the basis of chance (P < .01), 
and therefore fewer than the average of the two subjects. Dif- 
ferences for each of the experimental conditions are significant 
at the five per cent level and are in the expected direction. 

The influence of the second factor is shown in Table 5 where 
the average number of points per coalition obtained by A when 
A is in a coalition is tabulated. This analysis shows whether or 
not A is able to obtain a fair share of the points when he is one 
of the partners in a coalition. It is apparent from the table that 
the differences between means are once more in the anticipated 
directions. 

Table 5 
Average Points Per Coalition Earned by A 

Motiv. School Peer Non-peer Avg. 



High 


S. F. 


3.60* 


3.58** 


3.59 




Stan. 


3.19 


3.77 


3.48 




Avg. 


(3.36) 


(3.68) 


(3.53) 


Low 


S. F. 


3.12* 


4.00 


3.63 




Stan. 


4.15** 


5.16 


4.69 




Avg. 


(3.68) 


(4.57) 


(4.18) 


S. F. 


Avg. 


3.36 


3.81 


3.61 


Stan. 


Avg. 


3.63 


4.46 


4.06 






(3.51) 


(4.15) 


(3.86) 



*mean based on 5 groups. 
**mean based on 6 groups. 

Those groups in which A never succeeded in entering a coa- 
lition had to be omitted from the analysis. 
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The differences between the peer and non-peer conditions and 
between the high and low importance conditions are only signifi- 
cant at the 10% level of confidence. * 

The results of these three analyses are consistent in indi- 
cating that coalitions tend to form in opposition to the player 
who obtains an initial advantage. The strength of this tendency 
is greatest when the task is of high importance to the members 
of the group and when they perceive each other as peers or 
equals. As a result of this tendency, the paid participant is un- 
able to form his fair share of terminal coalitions and must pay 
more than a fair share of the points in order to form such 
coalitions. 

One additional analysis of the terminal coalition data was 
made to check on the validity of a deduction which was made 
from the theoretical formulation of this experiment. We would 
expect that in the peer situation B and C would be competing 
primarily with A and not with each other. We should then find 
that terminal coalitions between B and C took the form of an 
even division of the points. Conversely, under non-peer condi- 
tions, B and C would be competing primarily with each other, 
since A would be regarded as non-comparable by both of them. 
As a result, terminal coalitions involving B and C would tend to 
take the form of an uneven division of the points. An analysis 
of the types of terminal coalitions involving B and C supports 
this deduction. Under peer conditions 63% of such coalitions 
involved an equal split, but under non-peer conditions only 34 % 
of them did so, a difference significant beyond the .05 level. 
Thus, comparability of A to the other members in the group, 
induced by the peer conditions, makes a coalition involving an 
even split a desirable and stable outcome for the two subjects. 
Conversely, when A is incomparable, the competition between 
B and C makes the desirable outcome, for either one, an agree- 
ment which gives him more than the opposing subject. 

Another indication of the opposition to A is shown by an ana- 
lysis of the temporary coalitions, that is, those coalitions tenta- 
tively agreed to during the bargaining process. One measure of 
these is the number of excess points A must pay in order to 
break up an existing coalition between B and C. A discrepancy 
score was computed for all temporary coalitions in which A was 
not a member. This score is the difference between what the 
coalition subject was receiving in the coalition and what he ac- 
cepted from A in breaking up the coalition. If the subject re- 
fused all offers from A then the discrepancy was calculated as 
if the subject had accepted an offer of 9 points, one more than 
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the total number available. The average of these for each ex- 
perimental condition is shown in Table 6. 

Table 6a 
Average Discrepancy Paid to Break B-C Coalition 



Impor. 


School 


Peer 


Non-Peer 


Total 


High 


S. F. 


2.2 


2.0 


2.1 




Stan. 


3.4 


2.2 


2.8 




(Avg.) 


(2.8) 


(2.1) 


(2.5) 


Low 


S. F. 


2.4 


1.8 


2.1 




Stan. 


1.6 


1.2 


1.4 




(Avg.) 


(2.0) 


(1.5) 


(1.8) 


(S. F. Avg.) 




2.3 


1.9 


2.1 


(Stan. Avg.) 




2.5 


1.7 


2.1 


(Avg.) 




(2.4) 


(1.8) 


(2.1) 



Table 6b 
Analysis of Discrepancy Paid to Break B-C Coalition 

Source d.f. Variance Est. P 



Impor. 


1 


841.1 


<.05 


Peer Cond. 


1 


841.1 


<.05 


Schools 


1 


1.4 




Impor. Schools 


1 


970.9 


<.05 


Inter. 


3 


14.1 




Error 


48 


164.1 





The results follow the expected pattern. A must offer more 
points when the task is of high importance than when it is of low 
importance, and must offer more under peer conditions than 
under non-peer. The significant interaction between Importance 
and Schools is due to the fact that in the San Francisco data 
there are no differences between the High and Low Importance 
conditions while in the Stanford data these differences are large. 



DISCUSSION 

The theory which forms the basis for this experiment hy- 
pothesizes that competition arises because individuals, in 
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situations where they are evaluating some ability, are strongly 
motivated by a concern about their comparability to other mem- 
bers of the group with respect to the ability which they are 
evaluating. This concern over comparability leads to attempts 
to assess the abilities of others in relation to themselves. But 
an individual is not concerned over the comparability of all in- 
dividuals. Rather, he tends to exclude those who are perceived 
as definitely superior or inferior to himself in this activity and 
to concentrate on those who are perceived as being within the 
same general range of ability. 

To the extent that concern over comparability is present, dis- 
crepancies in points or other symbols come to have relative 
rather than absolute value. They tend to be interpreted pri- 
marily as indicators of the individual's status with respect to 
the other members rather than as something of direct utilitarian 
value. This is especially true when the points gained represent 
intellectual, athletic, or social ability, but it is also probably 
true to a large extent when they represent money or commodi- 
ties. As a result, whenever the task is made more important to 
an individual, the value of these points increases. This, in turn, 
adds to his concern over comparability to others and also to his 
motivation to surpass them on the task. 

It follows from these assumptions that in a bargaining situa- 
tion where all group members regard one another as comparable, 
stable coalitions will form in opposition to any member of the 
group who gains an advantage. The point advantage held by this 
member of the group is interpreted by the other group members 
as a loss in status to them, and they are consequently motivated 
to overcome it. Coalitions between them satisfy the motivations 
of both to retain comparability with each other while at the same 
time reducing the discrepancy between them and the individual 
with the initial advantage. The individual with the advantage in 
points cannot offer the same type of satisfaction. He will be 
compelled to offer excessive points commensurate with the 
score differential which exists in order to form a coalition. On 
the other hand, if this advantage is held by an individual regarded 
by the other two as non-comparable, coalitions in opposition to 
him will have less tendency to form, for his point advantage 
represents a smaller loss of status. Non-peer conditions would 
therefore be expected to be more favorable for a person with an 
initial advantage than would peer conditions. 

It similarly would be predicted that the formation of stable 
coalitions will be even more prominent as the importance of the 
task is increased. This follows from the assumption that any 
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increase in the importance of the symbols involved strengthens 
the motivation to achieve comparability in the group. Thus, 
conditions of high task importance will be less favorable to the 
person with an initial advantage than will conditions of low im- 
portance. 

The results of the experiment strongly support this general 
formulation of the motivations involved in bargaining behavior. 
This is shown clearly in the evidence that a group member re- 
ceiving an initial advantage in points is generally discriminated 
against throughout the remaining trials. It is reflected in the 
inability of the paid- participant to enter into the expected num- 
ber of terminal coalitions, in the relatively high price he is re- 
quired to pay in order to enter such coalitions, and in discrimi- 
natory bargaining in opposition to him as shown in the within- 
trials analyses. 

The evidence indicates that the strength of this opposition is 
a direct function of task importance and of the degree of com- 
parability between the group members. Such an interpretation 
is strengthened by evidence from the questionnaire that the 
conditions of the experiment were successfully manipulated. 

Differences between the two conditions of importance in the 
experiment are reliable and consistent in the analyses involving 
terminal coalitions, and in the intra-trial analysis of the excess 
of points required to break up a coalition between the two sub- 
jects. In each of these cases the bargaining was shown to be 
more favorable to the paid-participant under conditions of low 
importance than when importance was high. 

The effects of the peer and non-peer conditions are similar- 
ly in substantial accord with the predictions. Under peer con- 
ditions the paid-participant is less able to obtain points, less 
successful in entering into terminal coalitions, and he is re- 
quired to pay a higher price in order to do so than under non- 
peer conditions. This influence of peer relations is shown also 
by the excess of points which A is forced to pay in order to 
break up an existing coalition. 

It appears then that a large initial advantage in points re- 
sults in an intensification of competition against the paid- 
participant when that individual is regarded as comparable in 
ability. If the individual involved is regarded as non-compara- 
ble, the competition persists among the remaining group mem- 
bers, but the discrepant individual gains additional advantage by 
becoming the medium by which changes in status can be ac- 
complished between the others. Support for this latter state- 
ment comes from two sources: (1) examination of the relative 
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frequencies of occurrence of even and uneven point distributions 
in terminal coalitions involving B and C show that these tend to 
be formed with equal division of the points under peer condi- 
tions, but that under non-peer conditions such coalitions involve 
mainly inequitable distributions, and (2) an examination of the 
low importance, non-peer condition indicates that the paid- 
participant was able to obtain a significantly larger number of 
points than the average of the two subjects. (Table 3) 

Insofar as the findings in this experiment have generality, 
they have implications for two fields of inquiry, that of game 
theory and that of motivational theory, especially as each ap- 
plies to social situations. Game theory specifies that the 
choices between alternative strategies or courses of action 
should be chosen in such a way as to maximize utility. 

The present study suggests that the nature of the utility func- 
tion for individuals is not necessarily invariant, but is subject 
to modification from the effects of situational variables which 
may differ greatly from one context to the next. Consequently, 
these results indicate that motivational factors such as those 
suggested in the present experiment need to be included in 
formulations relating utility to external or behavioral reference 
points. 

The findings are more directly relevant to theories of social 
motivation and perception, especially as these pertain to be- 
havior in groups. The suggestion is that an important determi- 
nant of behavior in group situations where all members are en- 
gaged in a common activity is the concern of each member about 
his status relative to others on that activity. This is especially 
true in activities where there is no clear cut criterion available 
for individuals evaluating the adequacy of their performance. 
Consequently when a discrepancy between their own standing 
and that of others is perceived, individuals are motivated to re- 
duce that discrepancy. This motivation will manifest itself in a 
variety of ways, one of which is to form coalitions in opposition 
to any other member having a higher status on this task. 

Assuming that this formulation has generality, it gives rise 
to two important theoretical problems. The results of this ex- 
periment have shown that the strength of the motivational fac- 
tors involved depends in part upon the importance of the task to 
the individuals involved and in part on the degree of compara- 
bility between the group members. Consequently, it becomes 
necessary to formulate the conditions that determine whether 
or not a given task will be accepted as important by a given in- 
dividual, and the factors controlling his acceptance or rejection 
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of other individuals as a standard against which to evaluate his 
own performance. These factors undoubtedly include cultural 
and social variables as well as those unique to the past history 
of the given individual. 



SUMMARY 

The present experiment on competitive bargaining behavior 
in a group situation utilized 56 groups, each composed of two 
subjects and a paid-participant. The experimental variables 
were: (1) the importance of the task for the individuals involved, 
and (2) the degree of comparability between group members 
(peer versus non-peer conditions). Subjects were assigned to 
one of four experimental conditions as follows: (1) high task 
importance, peer relations, (2) high task importance, non-peer 
relations, (3) low task importance, peer relations, and (4) low 
task importance, non-peer relations. 

The experiment was designed so that the three group mem- 
bers were competing among themselves for points, but the for- 
mation of a coalition between two of them was necessary in 
order for points to be earned. The procedures used ensured 
that the paid-participant always obtained a large initial advan- 
tage. The rules governing the formation of coalitions specified 
that any agreements could be broken by either member of the 
coalition if he desired to enter a coalition with the third mem- 
ber. This made possible a continuous sequence of bargaining 
between the three members until the conclusion of the trial. 

The results of the experiment were as follows: 

1. The group member receiving a large initial advantage in 
points received significantly fewer opportunities to form coali- 
tions than did the other group members, and was required to 
pay a relatively higher price in order to do so. 

2. The reduction in opportunity to form coalitions and the 
commensurate increase in price demanded of the person re- 
ceiving a large initial advantage in points were more pronounced 
under conditions of high task importance than under conditions 
of low importance. 

3. The reduction in opportunity to form coalitions and the 
corresponding increase in price demanded of the person re- 
ceiving a large initial advantage in points were more evident 
under peer conditions than under non-peer conditions. 



TENDENCIES IN COMPETITIVE BARGAINING 253 

4. The results summarized in the three preceding paragraphs 
were reflected not only in the formation of terminal coalitions 
and the distribution of points therein, but also in the pattern of 
bargaining which occurred within trials. 



FOOTNOTE 

1. An analysis of co- variance on average points per trial earned 
by A adjusted for differences in number of coalitions of which A 
was a member yields the same conclusions. 



BIBLIOGRAPHY 

1. Festinger, L., Informal Social Communication. Psychological 
Review, 1950, ^7, 5, 271-282. 

2. Von Neumann, J. and Morgenstern, O., Theory of Games and 
Economic Behavior. Princeton: Princeton University Press, 
T944. (2nd Edition, 1947). 



CHAPTER XVII 

ON DECISION-MAKING UNDER UNCERTAINTY* 

by 

C. H. Coombs and David Beardslee 
UNIVERSITY OF MICHIGAN 

I. INTRODUCTION 

This paper presents a model which is a framework for a 
great variety of experiments in the area of decision-making 
under uncertainty. It is intended that this model will relate and 
organize experiments that have already been carried out and 
also suggest many more. The behavior with which the model is 
concerned is the preferential decisions of an individual among a 
set of "offers". Each of the offers consists of a "prize" (what 
he stands to win), a "stake" (what he stands to lose) and a prob- 
ability of winning. 

Willingness to gamble, or utility for risk, will be regarded 
as a suppressed variable primarily relevant for explaining dif- 
ferences between individuals 1 decisions not explained by differ- 
ences in their utilities or probabilities. The three variables, 
prize, stake, and probability of winning, are each regarded as 
psychological variables, not necessarily linearly or even mono- 
tonically related to, their "real" properties, e.g., "dollar" values 
or "objective" probabilities. The three variables, then, with 
which the model is concerned, are utility for prize (U(P) ), utility 
for stake (U(S) ), and psychological probability (^). 

The problem of the measurement of utilities and psychological 
probabilities is currently of great concern to psychologists [4], 
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[5], [9], mathematicians [8], [ll], philosophers [l], and econo- 
mists [10], [12]. Measurement of such variables on an interval 
scale requires the existence of a common unit of measurement 
and a common arbitrary origin, and further requires estimates 
of these. While models have been constructed for such purposes, 
as yet they have not been adequately tested and sustained experi- 
mentally. The method [3] of the experiments reported here does 
not require such assumptions, as it leads to an ordered metric 
scale [2] in which the stimuli are ordered and order relations 
on the distances between some pairs of stimuli are known. This 
method permits constructing such scales for a particular individ- 
ual. This is a desirable feature, as it may be anticipated that 
utilities and psychological probabilities are peculiar to individ- 
uals. 

In the section that follows, the component variables of the 
theory will be briefly discussed; then there will be a section 
classifying the different kinds of decision-making under uncer- 
tainty, giving, in effect, the different "dimensions' 1 of the prob- 
lem. A descriptive theory of decision- making under uncertainty 
is then presented in detail and followed by some experimental 
results. 



II. THE COMPONENTS 

A. Utility. Throughout this paper the term "utility" will be 
regarded as synonymous with "preferability." Degree of utility 
will be taken to be degree of preferability. If an individual says 
he prefers A to B, for whatever reason, this will be interpreted 
as meaning he has a higher utility for A than for B at that mo- 
ment. If there is inconsistency of his judgment under replication 
the majority judgment may be taken as indicative of relative 
utility. If an individual says that A and B are "harder to choose 
between" (as stakes or prizes) than are B and C, this will be 
said to imply that the difference in utility (as stakes or prizes, 
respectively) of A and B (the distance between them on a utility 
scale) is less than that between B and C. Formally, 

(1) A > p B if and only if U(P A ) > U(Pg) 

(2) A > s B if and only if U(S A ) < U(S B ) 

(3) (AB) (CD) if and only if AB < CD 

where: 
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^ means the judgment "pref erred to as prize"; 
jg means the judgment "preferred to as stake"; 

U(P A ) is the psychological utility of object A as a prize; 

U(S A ) is the psychological utility of object A as a stake; 

(AB) denotes the pair of objects A and B; 

'g" means the judgment "harder to choose between, as 
stakes, than"; 

AB is the distance between the objects A and B; 
means "less than" as stakes. 

Of course, an entire universe of problems arises, all of in- 
terest to the psychologist and economist, as to how the individual 
arrives at his utility for an object: what gives rise to his value 
system. These aspects of the utility variable are not considered 
here. This paper is concerned with measuring the utilities as 
they exist at any time in an individual, and with the role they play 
in decision-making under uncertainty. 

Utility for prize and utility for stake constitute a discrimina- 
tion on our part which may not be formally necessary in the 
model but jis necessary, or at least highly desirable, for design- 
ing or relating experiments. "Prizes" are the objects which an 
individual stands to win or gain. "Stakes" are the objects which 
the individual stands to lose or give up. Utility scales of either 
stakes or prizes are conceived of as running from negative 
utilities through zero to positive utilities. The prize for winning 
may be an object or consequence which the individual does not 
want and which would have negative utility as a prize. Corre- 
spondingly, the stake an individual puts up or invests may be an 
object which he wants to get rid of and, in consequence, this ob- 
ject may have a negative utility as stake. 

B. Probability. The following table is constructed to clarify 
the terminology used in the discussion of probability. The bot- 
tom row of the table, psychological probability, is regarded as 
an intervening variable between mathematical probability and 
decisions. This variable is involved, at least implicitly, in a 
large portion of human behavior. 



258 



DECISION PROCESSES 



Observation 



Replicable 



Single Event 



+/ 

a 

i 



a 



frequency interpreta- 
tion of probability; 
objective probability. 



mathematizable on basis 
of relative weights of 
contributing factors which 
are empirically deter- 
mined. 



bfi 



W 



personal probability: 
these may be percep- 
tions of relative fre- 
quency with objective 
frequencies implicit, 
or psychological values 
for explicit objective 
probabilities. 



subjective probability or 
degree of belief. 



The two kinds of personal probabilities may be clarified with 
examples. A personal probability which is based on implicit ob- 
jective frequencies would be an individual's feeling of the likeli- 
hood of filling a flush with a one card draw, derived from his 
past experience and not based on mathematical calculations nor 
on an explicit tabulation of such events in the past. Correspond- 
ingly, a personal probability based on explicit objective fre- 
quencies would be his psychological value for the known objective 
frequencies, whether obtained mathematically or from a tabula- 
tion of a number of such events. Whether personal probabilities 
are linearly related to relative objective frequencies or whether, 
for example, extreme relative objective frequencies are inflated 
or deflated in psychological value, are questions of experimental 
fact. They can be settled only by measurement of the psycho- 
logical magnitudes of the relative frequencies independently of 
their objective magnitudes. 

Examples which might clarify the meaning of subjective 
probabilities are: an individual 1 s feeling of the likelihood that 
a particular candidate will win an election, or that a state of 
war may be declared between two countries in a given interval 
of time. 

Again, a universe of problems arises as to the relation of an 
individual's psychological probabilities to the real world mathe- 
matical probabilities, and the axioms or laws that govern psy- 
chological probabilities. These problems range from questions 
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concerning their additivity to questions concerning the effect of 
information, attitudes, and beliefs on the psychological proba- 
bilities. As in the case of utility, the theory constructed in this 
paper is not concerned with the source of psychological proba- 
bilities and factors affecting them but rather with measuring 
them as they exist at a given time for an individual and the role 
they play in decision-making under uncertainty. 

C. Utility for Risk. While liking to gamble or willingness to 
take a risk may also be a parameter of decision- making under 
uncertainty it is a suppressed variable in this theory. The model 
conceived here can only be constructed for each individual sepa- 
rately. This is because there are no assumptions which permit 
comparing utilities or psychological probabilities between indi- 
viduals. Utility for risk would play an important role in com- 
parisons between individuals in dec is ion- making under uncer- 
tainty. 

D. The Measurement of Utility and Psychological Probability. 
The theory and experiments discussed in this paper are con- 
cerned with the nature of the interrelated roles that psychologi- 
cal probability and utility for stakes and prizes play in the 
decision- making of an individual. Clearly, utilities for objects 
and psychological probabilities may be expected to be personal 
to an individual and are not assumed here to be given as any 
prescribed function of their real world characteristics. Hence, 
their measurement must be derived from the judgments of an 
individual and the measures obtained are unique to him. 

The paradigm of these experiments involves the independent 
measurement of utilities and psychological probabilities and then 
independent observations on their "product space/' The method 
used to observe and derive measures of the utilities and the psy- 
chological probabilities is an adaptation of the Unfolding Tech- 
nique to the study of stimulus scales. This methodology is de- 
scribed in some detail elsewhere [3] so only certain character- 
istics of the measures will be discussed here. 

The methodology begins with the hypotheses that a certain set 
of objects (prizes, stakes, statements of probability or events, 
etc.) is an ordered set and that, in addition, pairs of objects are 
elements of a partially ordered set. The latter set gives a "dis- 
tance function." If the judgments of the individual satisfy certain 
conditions, the above hypotheses are sustained and the scale is 
recovered as an ordered metric [2]. Such a scale has the objects 
simply ordered and, in addition, some of the "distances" between 
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pairs of objects are known to be greater or less than other dis- 
tances. Such a scale lacks the convenience and power of meas- 
urement with numbers, as for example on an interval scale. 
However, it also lacks some of the assumptions required for 
measurement with an interval scale and also is vulnerable. By 
vulnerable is meant that the procedure for analyzing data does 
not necessarily yield a unidimensional scale, and thus it per- 
mits a test of the existence of such a scale. Hence, if uni- 
dimensional scales of utility or psychological probability are 
obtained, they are not consequences of the model alone. 



III. CLASSES OF DECISION- MAKING UNDER UNCERTAINTY 

The general area of decision-making under uncertainty in- 
cludes a wide variety of kinds of "environmental" conditions. It 
will be desirable to recognize explicitly some of these "dimen- 
sions" on which decision-making under uncertainty may vary. 
The following dichotomous classes are an initial attempt to 
identify these dimensions and the theory presented in this paper 
will be characterized in these different respects. 

A. Wager vs. Pay- to- Play. In a wager the individual's stake 
is forfeited in the event that he fails to win. In the case of pay- 
to-play the individual pays a fee which he loses regardless of 
the outcome of the play. A bet on a football game with another 
person is a wager, whereas buying into a pool on a game is pay- 
to-play. Buying a lottery ticket essentially characterizes pay- 
to-play. 

The theory presented here is explicitly for wager and not for 
pay-to-play. A possible modification of the theory to embrace 
pay-to-play would be to hypothesize that the utility of the price 
to play reduces the utility of the prize by a corresponding 
amount and the individual is playing a wager with "price to play" 
as stake and a prize worth that much less. 

B. Play Once vs. Repetitive Play. In the first case the game 
is only going to be played once, whereas in the second the game 
is going to be played a number of times. Conceivably, an indi- 
vidual's choice between playing a silver dollar slot machine or 
a nickel slot machine might be conditioned by whether the game 
is to be played only once or repetitively. An hypothesis might 
be that an individual would be willing to take a greater risk in 
"play once" than in "repetitive play." 
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In repetitive play there are a number of subclasses of en- 
vironmental situations: the successive offers may be the same 
or different, the length of the series may be known or unknown 
to the subject, the length of the series may be determined by 
the subject or the experimenter and before or in the course of 
the play. There are, in addition to these subclasses of repeti- 
tive play, subclasses pertaining to feedback, which may be im- 
mediate or delayed. Knowledge of success or failure may be 
immediate or delayed, and if immediate there may be either 
immediate or delayed payoff. Knowledge of success or failure 
is information which may affect psychological probabilities; 
payoff changes assets which may affect preferences between 
offers (i.e., the locus of the indifference curves). 

The experiments reported here are all under the play once 
rule. 

C. Method of Choice vs. Method of Single Stimuli. In the 
Method of Single Stimuli an offer is available which an individual 
may accept or reject. In the Method of Choice two or more al- 
ternative offers are available and the individual expresses a 
preference. The information contained in the data is different 
under these two conditions. In the Method of Single Stimuli the 
information is an absolute judgment as to whether the offer is 
on one side or another of an indifference surface in the product 
space. This surface thus partitions the space into a "play" and 
"no play" region. In the Method of Choice the information is a 
relative judgment as to which of the offers is "better", in some 
sense, to the individual. 

The Method of Choice may be further characterized as to 
whether the choice is compulsory or voluntary, ff the choice 
is voluntary no preference will be expressed when all offers 
lie in the "no play" region; hence, there is some absolute in- 
formation in such judgments, as well as relative. If the choice 
is compulsory a preference is always expressed and the entire 
product space may be studied, but there is no information in the 
data from which "play" and "no play" regions may be con- 
structed. 

The theory presented here is constructed explicitly for 
Method of Choice and the experiments all involve compulsory 
preference judgments. 
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IV, A DESCRIPTIVE THEORY 

A. A General Psychological Hypothesis-The Maximization 
of Expected Utility, The continua described permit a simple 
descriptive theory of decision-making under uncertainty. It is 
assumed that the psychological continua underlying these vari- 
ables have ratio scale properties but in the experimental pro- 
cedure used these scales are only recovered up to an ordered 
metric. The basic hypothesis is that in deciding between un- 
certain outcomes or events the individual chooses that offer or 
alternative which maximizes his expected utility. Note that this 
theory deals with maximization of "expected utility, 1 ' not with 
"utility of objective expected value." "Expected utility" is de- 
termined by some combination of U(P), U(S), and 0, and is to 
be distinguished from utility of expected outcome, U [objective 
probability x objective prize], 

This hypothesis is precisely analogous to the assumption 
frequently made in economics, save that the quantity maximized 
here is one involving U(P), U(S) and 0, all of which are psy- 
chological magnitudes. To decide which one offer involves a 
higher 0, a larger U(P), and less U(S) than any other offer, 
the simplest hypothesis is that the individual will prefer the 
offer that is maximal in some sense. When the individual must 
choose between two offers, one of which involves a larger prize 
than the other but with a lower of obtaining it, he must in 
some way decide "how much" utility to give up for a given in- 
crease in 0, or how much larger stake he will give to obtain a 
chance on a larger prize, and so on. 

This theory is offered as an hypothesis about the "rates of 
exchange" in such decisions. The hypothesis is that the indi- 
vidual always chooses that one of the available alternatives for 
which E is largest, where: 

(4) E = 

E = expected utility; 

= Psychological probability of winning the 
prize, tf> w < 1; 

U(P) = utility of the prize; 

~ psychological probability of losing the stake, 



U(S) = utility of the stake; 
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the exponents x, y, z, w are parameters defining 
for the individual the "relative weight" or "relative 
importance" of i/tyy, U(P), ^ L , and U(S), and are 
all > 0. 

It is to be remembered that E, U(P), and U(S) are all psy- 
chological magnitudes, rather than stimulus parameters. 

Equation (4) describes a general theory with more parame- 
ters than desirable. In the discussion and experiments which 

follow, only the case in which #L = 1 - ^w wil1 be treated > al " 
though the general theory as stated in equation (4) allows for 
situations in which the probability of losing the stake is not 
equal to 1 - W . A further restriction in the discussion and 
experiments which follow is the assumption that x = y = z = w = l 
unless otherwise noted. One illustration of a case in which x 
might not be equal to 1 might be "wishful thinking," that is, an 
overestimation of the ^ of winning. "Utility for risk" may be 
interpreted through these parameters. Such considerations will 
not, however, be pursued in any detail here, and, unless spe- 
cifically noted, the discussion which follows will concern only 
the theory modified by the simplifying assumptions just men- 
tioned. 

Although the working hypothesis that utility of a given object 
as stake is the negative of the utility of that same object as a 
negative prize appears here, this assumption is not necessary 
to the validity of the general theory. It is not assumed, on the 
other hand, that the utility of an object as positive prize is the 
negative of the utility for winning a debt of that object as a neg- 
ative prize. Symbolically, the working hypothesis is 

(5) If U(P A ) U(S A )< 0, then |U(P A )| = |U(S A )| 
(and it is not assumed that |U(P A )| = |U(S A )| when 

U(PA) U ( S A> > -) 

This theory as expressed in equation (4) involves the follow- 
ing assumptions of a psychological nature. First, the scales of 
utility and psychological probability are assumed to exist and 
satisfy the axioms for the real numbers, but these scales will 
be recovered only at the level of an ordered metric. Second, 
the hypothesis implies that the variables are independent, that 
they do not interact when combined in an alternative. By this 
is meant that a measure on any one of the variables U(P), U(S), 
and ip is independent of any measure on each of the other two 
variables with which it is associated in an offer. For example, 
an object A constituting a prize is assumed to have a measure, 
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U(P A ), which is independent of the probability of winning it and 
independent of the stake involved. It is this assumption which is 
implicit in the determination of the 0, U(P), and U(S) scales 
as independent variables. The measures of U(P) and of U(S) 
are obtained in a situation in which ^ is implicitly constant; the 
measures of psychological probability are obtained without men- 
tion of U(P) or U(S). However, the parameters x, y, z, and w 
allow the individual to make his own "idiosyncratic weightings" 
of i/>, U(P), and U(S) in arriving at an expected utility. The ef- 
fect of these parameters is, of course, to carry through an ex- 
ponential transformation on the axes as measured. 

Thirdly, in the general case, it would be assumed that the 
values x, y, z, and w were constant for this individual over at 
least the set of alternatives under study. Lastly, the theory as- 
sumes that the individual does combine ^, U(P), and U(S) into 
an E (the psychological value of an offer, or the utility of an 
outcome), and chooses on a basis of maximizing E. If the E 
of one offer equals the E of a second offer, the individual is 
said to be "indifferent" between the two offers; that is, he will 
not care which offer he chooses. Behaviorally, indifference will 
appear as a frequency of choice not significantly different from 
50% in repeated choices between alternatives 1 and 2. (This is 
discussed in greater detail in [8], page 374.) 

If it were possible to obtain measures of the three variables 
^, U(P), and U(S), and the parameters x, y, z, and w which 
were elements of ratio scales, the hypothesis would be simple 
to test. Since one cannot obtain even interval scale values for 
psychological magnitudes without strong assumptions, it is 
necessary to see what properties of the model can be made the 
subject of hypotheses testable at the level of an ordered metric 
on the variables. 

One way of visualizing the relations between */>, U(P), and 
U(S) is to regard them as orthogonal axes of a Euclidean space. 
Each alternative can then be represented by a point (0, U(P), 
U(S) ) in this space. The equation for E then defines for any 
value of E a three dimensional surface. (If 0L is not assumed 
to equal 1 - ^v, it is of course necessary to deal with a four 
dimensional model.) These surfaces over which E is constant 
will be called "indifference surfaces," since by definition the 
individual does not care which of two alternatives is chosen if 
their E are equal. 

The relation between ^ U(P), and U(S) will be clarified by 
examining successively the relation between each pair of them 
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with the third held constant. First, the relation between ^ and 
U(P) will be examined with U(S) held constant. 

B. Stake Constant. Holding stake, U(S), constant is equiva- 
lent, geometrically, to passing a plane through the three dimen- 
sional space perpendicular to the stake axis. These planes, in 
which U(S) = K, will be called "stake planes." In these, 

E = i/>U(P) - (1 - i//)K 

1. Zero Stake. If U(S) = 0, then E = ^U(P). Geometrically, 
this is a plane which passes through the origin and is the locus 
of all offers in which the individual puts up something which has 
no utility, positive or negative, for him. This zero stake plane 
cuts the indifference surfaces in a family of truncated hyper- 
bolas which describe those combinations of ^ and U(P) which 
make up offers between which the individual is indifferent. The 
indifference curves sketched in Figure 1 summarize* the fact 
that for positive prizes the individual is indifferent between a 
small prize with a high ip of winning and a larger prize with a 
smaller i/> of winning. For negative prizes (U(P) < 0), he will 
be indifferent between a small negative prize with a high ^ and 
a large negative prize with low ^. For a prize U(P) = 0, the 
indifference curve will be vertical, since if the individual never 
wins or loses anything, the odds do not matter. This vertical 
indifference curve will be called the "null line." 



s 




-U(P) 




Fig. 1. 
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The theory implies that indifference curves other than the 
null line will be convex. This means that the slope of a given 
curve increases monotonically with increasing distance from 
U(P) = 0. The slope of an indifference curve indicates at that 
point the "rate of exchange" between psychological probability 
and utility for prize. An indifference surface or curve may be 
identified (labelled) by the value E associated with all points 
on it. When = 1, E = U(P). (See Figure 2.) 




Fig. 2. 



Note that in this case the general form of indifference curves 
is not dependent on the assumption for the parameters that x = 
y = 1. For when E = X U(P) V , x > 0, y > 0, the indifference 
curve will certainly be convex. 

2. Positive Stake. If U(S) = K, K > 0, then 
E = 0U(P) - (1 - 0)K. 

In addition to a family of truncated hyperbolas, there will exist a 
vertical indifference curve (null line) located at U(P) = E = -K, 
and the indifference curves in the plane U(S) = K will have the 
form shown in Figure 3. 

This vertical indifference curve occurs when the prize to be 
acquired has a negative utility equal to the utility of the stake 
to be lost. In this case, the individual obviously will not care 
whether he wins or loses, since whichever happens he has to 
give up the same amount of utility. Consequently, he will not 
care what he is offered. 
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-K 



+ U(P) 



Fig. 3. 



3. Negative Stake, When U(S) = K, K < 0, by the assump- 
tion of equality of negative prize and positive stake or of positive 
prize and negative stake (equation (5) ) the situation is a trans- 
lation of that for positive stake. The null line will be at 

U(P) = E = -K. 

4. Some General Properties of Stake Planes. By hypothesis, 
an individual's preference among any offers will depend upon 
their psychological expected values, E, i.e., upon the relations 
of the indifference surfaces (here, curves) in which those offers 
lie. Offer 1 (^, U(PI)) will be preferred to offer 2 (^2, U(P2)) 
if and only if the indifference curve through (^j, U(PI )) lies "to 
the right of" or "above" the indifference curve through ft/*j,U(P 2 ))- 
This is the information assumed to be contained in the data when 
the Method of Choice is used to observe behavior. 

In general, when #= 1, E = U(P). The indifference curve 
which contains the offer ^ = 1, U(P) = is the locus of all of- 
fers for which E = 0, i.e., all offers in which the individual's 
expected outcome is zero. This curve divides the plane into a 
"positive expected outcome" and a "negative expected outcome" 
portion. 

That portion of the plane for which E > is presumably the 
locus of all offers which the individual will wish to take when the 
alternative is not to play, since refusing to play is accepting the 
alternative (tf> = 0, U(P) = 0, U(S) = 0). In the general case, the 
E = surface partitions the space into those offers the individual 
will take, and those he will refuse if he is permitted to do so. 
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The portion of the space (or plane) for which E > we term 
the "play region" and its complement the "no play region." 
These are the two regions which are distinguished when the 
Method of Single Stimuli is used to observe behavior. The 
E = surface is called the boundrry-of-play surface. 

The vertical indifference curves mentioned previously in 
connection with the stake planes are the intersections of the 
stake planes with a vertical surface through the origin called 
the "null surface." The null surface contains all points for 
which U(P)y = U(S) W 0. If y = w the null surface becomes a 
null plane as indicated in Figure 4. This plane intersects a 
horizontal plane in a line for which U(P) = U(S)0. 



MULL \. 




Fig. 4. 



The indifference curves through points on a line parallel to 
the null plane, $ constant, have the same slope. This can be 
seen by noting that if we take a given indifference curve through 
(01, U(PI), U(Si)) and consider the curve through 

U(Pi)-C, U(Si) + C). 



Let the parameters of these two curves (cf. equation (4) ) be 
E and E 1 respectively, then we have : 

E' = </>i(U(Pi) + C) - (1 - 0i)(U(Si) + C) 



or 



E' = 0lU(Pi) - (1 - 01) U(Si) - C . 
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Hence, E' = E - C, and the rate of exchange of ^ and U(P) is 
unchanged. Since it is the relation of ^ and U(P) which defines 
the shape of the curves in stake planes, the indifference curves 
related by U(P) 1 = U(P) - C, U(S) f = U(S) + C are all of the 
same shape. It is clear that if U(S) is increased, the slope of 
the indifference curve through a given i// and U(P) becomes 
less. 

If x and z are not both equal to 1, the shapes of the indif- 
ference curves will change and certain indifference curves will 
now possess points of inflection. An example of an inflected in- 
difference curve is shown in Figure 5. What psychological ap- 
plications such a situation might have have not been studied. 
Bizarre as such an indifference function may seem, it is worth 
noting that the existence of an inflected indifference curve is 
potentially determinable at the level of ordered metric data. It 
is at least conceivable that some cases of "apparently irrational 
behavior" might be explicable on some such hypothesis as x or 
z ^ 1. It would still be the case that one offer will be preferred 
to a second offer if the indifference curve Containing the first 
lies to the right of the indifference curve containing the second. 

C. Prize Constant. By holding prize, U(P), constant, planes 
perpendicular to the U(P) axis are obtained. Indifference curves 
in these "prize planes" will be truncated hyperbolas similar to 
those in stake planes, save that the preference is assumed to be 
for smaller stakes and for higher probabilities. The effect of 
this is to "invert" the generalized hyperbolas in the sense of 
Figures 6, 7, 8. 

1. Zero Prize. (U(P) = 0). In the case that U(S) > 0, the 
individual cannot win anything, and he will prefer to put up 
smaller stakes, and will prefer smaller probabilities of losing 
his stake. Since 0L = 1 - 0^, he will prefer therefore larger 
i/>W = & K U(S) < 0, implying that getting rid of his stake has 
positive utility for him, he will prefer larger negative stakes to 
smaller negative stakes, and of course larger probabilities of 
losing the stake. The relations are shown in Figure 6. Since in 
this plane E = (1 - ^) Z U(S) W the preference is for the offer 
whose indifference curve is to the right of the curves for other 
alternatives in the set. 

2. Positive Prize. When U(P) = K, K > 0, and hence 

E = J//K - (1 - ^)U(S), we have an inverse of the situation for a 
positive stake place. When -U(S) = U(P) the probability of win- 
ning is irrelevant and there exists a vertical indifference curve 
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Fig. 5. 
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Fig. 6. 

(null curve), as in Figure 7, which is the intersection of the 
prize plane with the null plane previously defined. As with the 
zero prize plane, one offer is preferred to a second offer if 
the indifference curve through the first lies to the right of the 
indifference curve through the second. 




Fig. 7. 



3. Negative Prize. (U(S) < 0). This situation is a transla- 
tion of that for positive prize, with the null line and the indif- 
ference curves displaced to the left of the origin, as indicated 
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in Figure 8. The null line is located at U(P) = -U(S) = E. Pref- 
erence relations remain as before. 




Fig. 8. 

D. Probability Constant. Setting = K yields a plane nor- 
mal to the 0-axis, with ifj constant throughout, which will be 
referred to as a probability plane. For 0=1 (the certainty 
plane), the player is indifferent as between stakes he's going 
to win anyway. His preference is strictly for a larger prize; 
hence, the indifference curves are straight lines, parallel to the 
U(S) axis. For = (the certain-loss plane), the player is in- 
different as between prizes; he can't possibly win. His prefer- 
ence is for the smallest possible stake; hence, the indifference 
curves are straight lines, parallel to the U(P) axis. The two 
situations are represented by Figure 9. 

It is assumed that the transition from the certain- win plane 
to the certain- loss plane is continuous. The nature of this 
transition is described as follows. (We are indebted to Charles 
H. Hubbell for this development.) 

The player is indifferent as between all situations having the 
same expectation of gain: 

E = 0U(P) - (1 - 0)U(S) 
Solving for prize and stake: 



U(P) = 
U(S) = 



U(P) 



E 



ON DECISION-MAKING UNDER UNCERTAINTY 273 



-UP) 




* +U(p) 



U(s) 



Fig. 9. 



For fixed (i.e., for a given probability plane), these are 
straight lines. In particular, the line through the origin (E = 0) 
constitutes a boundary between willingness to play (E > 0) and 
reluctance to play (E < 0). All indifference lines in the same 
plane are parallel to this, having the same slope. Regardless 
of the value of 0, the boundary- of -play line passes through the 
point (U(P) - 0, U(S) = 0). 

If is allowed to vary, this boundary-of-play line rotates, 
as shown in Figure 10, where (+) and (-), for each boundary- 
of-play line, indicate M play Tt and "no play M regions. This rotat- 
ing line moves from 0=1 to = 0, as it rotates. In so doing, 
it sweeps out the indifference surface for E = 0. At = 1, it 
is coincident with the projection of the U(S) axis, and at = 0, 
with the U(P) axis. This is shaped something like a circular 
staircase, as in Figures 11 and 12. For E = 0, it is the 
boundary-of-play surface. 

If E 0, then (U(P) = E, U(S) = -E) is the axis of rotation. 
All such axes lie in the null plane, all points of which satisfy the 
expectation equation: 

E = 0U(P) - (1 - 0)U(S) 
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Fig. 10. 




Fig. 11. 




Fig. 12. 
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as was indicated above. 

The generation of each indifference surface is exactly simi- 
lar to that for the boundary- of -play surface, with U(P) = E, 
U(S) = -E as the axis of rotation. Hence, all indifference sur- 
faces are similar. Since each is completely described by the 
regular motion of a straight line (the generator), they are 
known, mathematically, as ruled surfaces. 

These surfaces actually have two sets of rulings. The other 
ruling, for the boundary- of -play surface, has as its axis of ro- 
tation, a line in the probability plane </> = 1/2, normal to the 
null plane. The generating line is at all times perpendicular to 
the axis, and rotates in such a way that it always passes through 
the U(P) axis and through the projection of the U(S) axis on the 
certainty-plane. Its intercepts on those lines are equal, and of 
like sign. For U(P) = U(S) = 0, the generator coincides with 
the ^ axis. (See Figure 13.) 




t# 



Fig. 13. 
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V. SOME EXPERIMENTAL DATA 



A. An Illustrative Pilot Study, A pilot experiment was con- 
ducted during the summer of 1952 (This experiment was con- 
ducted by David Beardslee and Stefan Vail. The subject was one 
of the secretaries for the project.) to see if some of the things 
predicted by the model happened, and to serve as an illustration 
of the method. For stimuli for the U(P) scale, five objects 
were selected which were pictured in the advertising pages of 
a newspaper. These were to be the "prizes" in an imaginary 
lottery. These stimuli were as follows: 



a - a rattan chair, 

b - an electric broiler, 

c - a typewriter, 

d - a radio, 

e - a portable phonograph. 

The pictures, of course, provided the subject with a concrete 
stimulus which was perhaps more susceptible to constant valua- 
tion than the names of the objects are. These stimuli were pre- 
sented by the Method of Similarities [3] and the subject's U(P) 
scale obtained. The obtained I scales are given in Table 1. 



Table 1. 



Stimulus at 


Chair 


Broiler 


Typewriter 


Radio 


Phonograph 


Head of 












I Scale 


a 


b 


c 


d 


e 




a 


b 


c 


d 


e 


Reduction of 


b 










Paired Compari- 












son Judgments 


c 


d 


d 


e 


d 


to a Rank Order 
I Scale 


d 


e 


b 


b 


b 




e 


a 


a 


a 


a 



It is to be noted that these five I scales do not satisfy the 
same simple order. Three are consistent with the ordering of 
the stimuli a b c d e from low to high in utility, and two are 
consistent with the ordering a b d c e. 

At the end of the experiment, however, the subject was asked 
to place the stimuli in rank order and the obtained rank order 
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was abode. Also, the data of a later experiment in which 
these stimuli served as components of uncertain offers are com- 
patible with the hypothesis that the proper rank order is a b c 
d e. 

The data also contained the metric information that 

ab >be 
cB>de 

that is, that the difference in utility U(P) between stimuli a and 
b was greater than the difference in U(P) between b and e; and 
the difference in U(P) between c and d was greater than the 
difference between d and e. 

Time did not permit the determination in this pilot experi- 
ment of a scale on some probability stimuli, so explicit objec- 
tive probabilities were used in constructing offers, and it was 
assumed that ^ = objective probability. The probabilities used 
were .10, .30, .40, .60, and .70. This ^ scale in combination 
with the U(P) scale above defines 25 offers in the product 
space, of which eight were selected for study. Two additional 
offers were made up by combining "both a and b" as prizes 
with two of the probabilities. 

These 10 points are shown in Figure 14, which represents a 
stake plane. It must be kept in mind that only the order of 
stimuli on the two axes and certain metric information is actu- 
ally known; although the illustration is drawn as though the 
points had known numerical values, such is not the case. The 
stimuli F and I in the space have been located ordinally on the 
U(P) axis from metric information contained in the data. They 
were not located on the U(P) scale empirically as they might 
have been. 

The offers were presented to the subject for judgment by the 
Method of Triads. The responses of the subject are given in 
Table 2. This tabulation was arrived at as follows. The subject 
was presented with every possible triad, of which there were 

(3 ) = 120, and instructed to judge in each triad which "chance" 
she would most prefer to take and which least. The game was 
free, the subject had no stake, i.e., U(S) = 0. 

For example, E, H, and I constituted one such triad. In this 
triad the subject said she preferred E the most and I the least. 
This was tabulated as three paired comparison judgments, 
E > H, E > I, and H > I (where the symbol > signifies pre- 
ferred to). Each of these pairs occurred in eight triads, n-2, 
and hence each such paired comparison was made eight times. 
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Table 2. 



A, B 8* 


8 B, C 


5 C, E 3 


D, H 8 


F, H 8 


A, C 8 


8 B, D 


8 C, F 


7 D, I 1 


7 F, I 1 


A, D 8 


B, E 8 


7 C, G 1 


D, J 8 


F, J 8 


A, E 8 


8 B, F 


8 C, H 


8 E, F 


G, H 8 


2 A, F 6 


1 B, G 7 


8 C, I 


7 E, G 1 


8 G, I 


A, G 8 


B, H 8 


7 C, J 1 


8 E, H 


8 G, J 


A, H 8 


8 B, I 


D, E 8 


8 E, I 


8,H, I 


2 A, I 6 


2 B, J 6 


3 D, F 5 


8 E, J 


8,K, J 


A, J 8 


8 C, D 


D, C 8 


F, G 8 


I, J 8 



*To be read "no judgments of A preferred to B, eight judgments 
of B preferred to A." 



The tabulation of these eight judgments for each pair is con- 
tained in Table 2. It should be noted that there are two 
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judgments made on a triad which are here decomposed into 
three which are consequently transitive. Thus, on the average, 
there are only 2/3 of a degree of freedom associated with each 
paired comparison judgment. But the eight times each judgment 
was made accumulates these degrees of freedom so they may 
be regarded as having five and one-third degrees of freedom 
per paired comparison, assuming the successive presentations 
of triads are experimentally independent. 

Table 2 reveals that an offer to the right or above another 
offer in the product space is preferred. There were eighteen 
comparisons of such pairs. On three of these pairs the judg- 
ments were inconsistent: 7 C, G 1; 7 E, G 1; and 7 F, I 1. 
According to the theory these should all have been 8 and 0. 
The other 15 pairs were all consistent, 8 to 0, and as predicted. 

Another experiment was then run on the same subject using 
just five stimuli and the Method of Paired Comparisons but re- 
peated under conditions of a stake (imaginary) of $1, $10, $15, 
and a negative utility for stake (a studio couch which the subject 
wanted to get rid of but which would cost money to have hauled 
away). 

The stimuli used are indicated in Figure 15. At each stake 
the subject made 10 paired comparisons. In each instance the 
10 paired comparisons were transitive so the data are given 
below as a rank order in Table 3. 



Table 3. 



Stake I Scale 

$15 A B C D E 

10 B A C D E 

1 C D E B A 

Studio D E C B A 
Couch 



These I scales define certain "pieces" of indifference curves 
which are shown in Figures 16 - 19. These figures represent 
only order relations and some metric information. Innumerable 
transformations on the axes are permissible, so the shapes in- 
dicated for the indifference curves represent confidence in the 
theory rather than experimental demonstration of it. 
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Fig. 19. U(S gtudio couch> 
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B. Experimental Tests of the Shape of Indifference Curves. 
In a more extensive study, Coombs and Milholland have ob- 
tained the metrics of the */> and U(P) axes independently, and 
were able to show that there existed at least one convex indif- 
ference curve for one of their subjects [4]. In another study [6], 
Fliege found additional evidence of the existence of convex in- 
difference curves. 

C. Mosteller and Nogee's Experiment. Mosteller and Nogee 
have reported [8J an experiment which may be analyzed in 
terms of this theory. They presented their subjects with a large 
number of possible bets, and for each allowed him to choose whe- 
ther or not to accept the bet. They selected 7 odds (7 values of 
0), and for each they offered several different prizes, with stake 
constant throughout. Each opportunity to bet was offered several 
times. This is the Method of Single Stimuli mentioned above, and 
permits only location of the subject's E = indifference surface, 
i.e., the boundary- of- play surface. (Since stake is constant, they 
determined only an E = indifference curve, in the stake plane 
for U(S5^).) The general situation is shown in Figure 20. Since 
for each value of & several prizes were offered, each several 
times, they could compute for each ^ a cumulative "frequency 

of acceptance of bet" curve, and they took the prize for which 
the bet would be accepted 50% of the time as defining the E = 
indifference curve. (See Figure 21.) If there were no error, the 
cumulative frequency curve should have the shape shown in 
Figure 22. They assume that psychological probability, */>, is 
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linear with objective (and experimentally explicit) probability. 
They obtained 'numbers by assuming that utility, U(P ) = 0, and 
that utility of stake, U(S g ^) = 1. 

Then by assuming the E = indifference curve to be a trun- 
cated rectangular hyperbola (^U(P) = E = a constant), they were 
able to compute the relation of monetary value and U(P). The 
relation between these two they plotted as the utility of money. 
Further portions of the experiment dealt with preferences be- 
tween two alternative bets and the percentage of times compound 
bets would be accepted. They predicted (on the basis of the first 
portion of the experiment) the percentage of times one offer or 
the other would be chosen, or a compound bet accepted, by using 
the explicit objective probability, the experimentally determined 
U(P), and computing an "expected utility/ 1 

There is some evidence [5], [6] that \l/ is not a linear func- 
tion of the objective probability. In this case the procedure used 
by Mo s teller and Nogee no longer yields a unique solution for the 
relation of monetary value and U(P). 

D. Experiments on the Relation Between ^ and Objective 
Probability. Preston and Baratta [9J have attempted to deter- 
mine the relation between objective probability and *//. They had 
their subjects bid for prize-probability combinations which were 
then played. This is a variant of the pay-to-play situation in 
which the price to play is determined by competitive bidding. 
Since this situation involves assumptions about the comparability 
of ty and U(P) scales for different individuals, no attempt is 
made to analyze their results here. They did find that the func- 
tion relating objective probability and ty (computed from group 
data) was relatively invariant over changes of prize. A similar 
function was obtained by Griffith [7] in an analysis of horse- 
race betting. He also observed that for a large number of indi- 
viduals, offers, and times, the boundary-of-play surface approxi- 
mates the objective zero expected value. 

Edwards [5] in a study of choice between offers varying in 
i/>, U(P), and U(S) found a function relating group data on i// and 
objective probability which, like Preston and Baratta 1 s, was in- 
variant under changes in U. The nature of this function is very 
different in the two cases, however. Unpublished analyses of the 
data for individual subjects of Edwards suggests that grouping 
the data in studies of this type masks large individual differences 
in i/x and U(P) scales. 
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VI. SUMMARY 

This chapter has presented a general system designed to or- 
ganize the field of decision-making under uncertainty and to 
suggest experiments in that domain. It attempts to clarify the 
differences between certain situations, and suggests explana- 
tions for differing behaviors. A number of experiments in the 
field of decision-making under uncertainty have been analyzed 
in terms of this theory. 
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CHAPTER XVIH 

ENVIRONMENTAL NON-STATIONARITY IN A 
SEQUENTIAL DECISION-MAKING EXPERIMENT 

by 

Merrill M. Flood* 
COLUMBIA UNIVERSITY 

1. PROBLEM 

The pilot experiment reported here was stimulated in part by 
some results discussed by W. K. Estes. One such result is the 
following: In certain two-choice situations, where the reward 
probabilities are it\ and 7T2, some organisms will tend eventu- 
ally to choose the ith alternative with probability p^, given by 









Where 



For example, Estes (DP) referred to a case in which the reward 
probabilities were TTJ =0.5 and 712 = 0, and the average p^ of 
several human subjects was observed experimentally to be 0.67. 
Estes stated that formula (1) fitted all his data well, when aver- 
aged over enough subjects and trials, and remarked that it was 
derived from a system of mathematical axioms forming a basis 
for learning theory. 

After Estes had presented his paper, various game-theorists 
in the audience argued (incorrectly) that this observed behavior 
was surprisingly "irrational," since the organism's proper 
strategy is clearly pure, rather than mixed, and so the organism 



*This work was started while the author was with The RAND 
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should learn eventually to choose only the alternative providing 
the more frequent reward. The present writer countered this 
game-theoretical argument with the following two objections: 

(a) The proper definition of payoff utilities would be unclear 
in attempts to apply game-theorttic arguments to a real case, 
and there is a reasonable payoff matrix that would rationalize 
the reported behavior. Thus, if the organism's object were to 
maximize its score rather than its expectation, then it should 
sometimes not tend to use a pure strategy. For example, one 
intelligent subject in a punchboard experiment conducted by the 
author (1952) remarked that her only hope to get a "perfect" 
score in a (90,10) experiment would be based on playing 90 in 
the 90-case and 10 in the 10-case and being lucky on each play. 
Hence, if absolute perfection were assigned sufficiently high 
utility, her optimal behavior, even in the game- theoretic sense, 
would not be to always choose the same case. Of course, this 
same argument might suggest that she should use a single-choice 
strategy in a (50,0) experiment, but only if she is thoroughly con- 
vinced that the other response will always be non- rewarded. At 
any rate, a defense of the mixed-choice behavior can be made 
along these general lines. 

(b) The von Neumann-Morgenstern game theory is inapplicable 
in this situation unless the organism can assume safely that the 
experimental stimulus is generated by a stationary stochastic 
process. For example, if the organism believes that there may 
be some pattern (non-stationarity) over time, in the stimulus, 
then it can often do better by using a mixed strategy rather than 

a pure one, for the latter would give it no way to discover any 
pattern effect. At present, mathematical game theory and sta- 
tistical decision theory provide no generally accepted prescrip- 
tion of optimal behavior in the non- stationary case; indeed, it is 
quite conceivable that organic behavior will be found to repre- 
sent such a solution if and when it is understood. In response to 
our query on this point, Estes paraphrased the instructions given 
to his subjects. It seemed significant that he had not tried to sug- 
gest to them that they were, in fact, being confronted with a sta- 
tionary process. In the author's experiments, on the other hand, 
we had emphasized very strongly the exact nature of the station- 
ary process (see Appendix A) confronting the subjects, 2 and it 
had seemed to us that this fact had helped the subjects go to pure 
strategies. We conjectured, therefore, that the pure type of be- 
havior would be found in subjects who were convinced of station- 
arity and that the mixed type of behavior would be found sometimes 
in subjects who believed that there might be non-stationarity. 
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These issues were much discussed during the Santa Monica 
Conference, and there seemed to be agreement that the matter 
was worthy of further experimental investigation. 

R. R. Bush (DP) offered a theoretical explanation of the pure 
and mixed cases by showing how the Estes formula (1) is ob- 
tained by assigning certain very special values to the parameters 
in the Bush-Mosteller [l] mathematical learning model. Bush 
also derived an asymptotic mixture formula for the r-choice 
case, and, with G. L. Thompson (DP), has remarked on some of 
the pertinent experimental questions. The Bush r-choice for- 
mula is 

1 
(2 ) Pi = _lHL_ for (1 = 1,2,- -, r) 




where p^ is the eventual probability of the organism choosing 
the ith alternative, and TTJ is the probability that the jth alter- 
native will be rewarded. Formula (2) was derived from the Bush- 
Mosteller model in [l] on the assumption that if the ith alterna- 
tive is always rewarded (non- re warded) then pj = 1 (0). 

The punchboard experiments to be discussed here include in- 
stances in which r = 2 and r = 9, so they provide some data that 
are relevant to the question concerning the applicability of the 
pure and mixed models discussed by Bush and Thompson (DP). 
In particular, we present the results of a pilot experiment carried 
out in an effort to devise a test of the hypothesis that behavior of 
human subjects is mixed or pure according as they are or are not 
convinced of non-stationarity in a situation that is actually sta- 
tionary. 



2. EXPERIMENT 

The equipment and procedures used were explained in written 
instructions given to the subjects before the trials began. The 
set of instructions used in the initial 9-choice experiment, where 
stationarity was stressed, are summarized in Appendix A. The 
set used in the 2-choice experiment, where stationarity was not 
mentioned, is indicated in Appendix B. The reader is referred 
to these two Appendices for information about the equipment and 
procedure used, and familiarity with this material is now as- 
sumed. 
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In the 9-choice case, the reward probabilities were each 
chosen at random on the interval (0,1). In the 2-choice case, the 
ten pairs of reward probabilities were chosen arbitrarily so as 
to provide information on pairs scattered rather evenly among 
all possibilities. The equipment was identical in the two cases, 
and even the same codes were used, except that the subject was 
limited in the 2-choice case to choices between a pair of col- 
umns specified at the top of each code. The actual values for 
these pairs are indicated in the following table.- 



Code No. = 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


100 7TJ = 


51 


76 


55 


40 


80 


56 


98 


79 


92 


96 


100 7T 2 = 


24 


09 


09 





30 


44 


49 


59 


69 


17 



In the 9-choice case there were 10 subjects and each did the 
same 10 punchboards with 100 choices on each. In the 2- 
choice case there were two subjects, one of whom had previous- 
ly done one punchboard following the instructions of Appendix A. 
All subjects were unfamiliar with game -theoretic notions. 



3. HYPOTHESIS 

Data are rather meaningless, of course, except as they are 
interpreted with respect to some well-defined hypothesis. In 
this instance, the hypothesis is: Behavior is asymptotically 
pure or mixed according as subjects are or are not convinced 
of non-stationarity. As one way to begin testing this general 
notion, a comparison is made of the frequencies of later choices, 
in each response class, with the asymptotic frequencies that are 
predicted theoretically by the Estes-Bush formula (2) for p 4 in 
the mixed model. 

There is a real difficulty in knowing what 'later choices' 1 
should mean in this context. For example, if ir 2 = 1, then the 
mixed model predicts that P2 = 1 whenever TTJ < 1, even though 
a subject would probably show very different behavior over a 
large sample of early trials according as TTJ is close to zero or 
close to unity. Furthermore, it is quite possible that subjects 
may behave according to the mixed model for a certain period 
at the beginning and switch to the pure model after their early 
experience convinces them that the process is truly stationary. 
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We shall be interested in comparing three quantities, com- 
puted from the observations, with the theoretical value P = p z 
corresponding to the column z for which TT Z = max 7^. First, 

we let F denote the relative frequency of choice of column z 
during trials x through y. Next, we let c denote the number of 
the last trial in which column z was not chosen. Finally, we 
consider F}, Ff , and FjJ as estimates of P. 

The use of Fj as an estimate is justified on the ground that 
the subject can not have gone to a pure strategy before trial c+1, 
and so F for x < c represents the relative frequency of choos- 
ing column z just prior to the change to a pure strategy if this 
change did in fact occur after trial c. Of course, F has a 
downward bias due to the possibility that the change to a pure 
strategy may in fact have occurred several trials after trial c. 
Since P is supposed to refer to asymptotic behavior, we should 
prefer to choose x large in calculating F but this has not been 
done here simply because the condition x < c is often incom- 
patible with the requirement that c - x be large enough to pro- 
vide a suitable sample size for the calculation of F. 

It might be possible to make observations more directly con- 
cerning the subject's state of belief with regard to stationarity, 
perhaps even by asking him after each trial. There is the ob- 
vious danger that such queries might alter the experimental 
stimulus and thus affect the responses. Some of this danger 
could be avoided, conceivably, by asking all questions at the 
close of the work by each subject, and this has sometimes been 
done. We hope to approach this question more directly in future 
experiments. 



4, RESULTS 

The data for the two-choice case are summarized in Table 1. 
The choices of subject FD do not seem clearly explicable in 
terms of the mixed model. For example, all of his choices on 
his last board (Code 6) were made on a column that gave wins 
on only 56% of the trials. One possible explanation of this be- 
havior is that he set a standard of 50% wins as satisfactory and 
then did not search for a better column as long as he felt that 
this standard was met, as was the case on this last board after 
the sixth trial. A fact consistent with this explanation is that of 
his continued effort (42 attempts through trial 86) on his first 
board (Code 4) to win on a column that never paid off while the 
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Table 1 
Stationarity Unspecified (2-Choice Case) 



Code No. = 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


100 P = 


61 


79 


66 


63 


78 


56 


96 


66 


80 


95 


Subject MS 






















100 F}= 


54 


57 


77 


33 


70 


63 


100 


66 


93 


97 


100 Fj = 


39 


48 


76 


33 


68 


61 


~ 


66 


82 


92 


c = 


79 


83 


94 


100 


95 


95 


~ 


100 


38 


36 


Subject FD 






















100 F} = 


40 


96 


75 


58 


72 


100 


100 


97 


100 


89 


100 Ff = 


40 








51 


18 


-- 





94 








c = 


100 


4 


25 


86 


34 


-- 





52 


._ 


11 



better column was paying off less than half of the time. These 
possibilities are typical of the experimental complications that 
arise because of the particular psychological set of the subject 
at the start. 

Subject MS, on the other hand, seems to have performed in a 
manner quite consistent with the mixed model, as judged by the 
close agreement between P and FjM. His seventh board (Code 
4) is a notable exception, however, and it is a surprising fact, in 
this connection, that he devoted his last 28 trials to a choice that 
always failed and that had always failed on his previous 39 trials 
with it. It is significant that he kept trying both columns well 
through all but two of his boards, and that he remarked during 
and after the experiment that he was convinced that there was 
some sort of pattern and that he might find it if he kept on hunt- 
ing for such regularities. If it is supposed that this hunting ended 
soon after trial c, then one would expect that Fj might be in 
even better agreement with P than is F|. This seems to be 
true for subject MS, except for his ninth board (Code 1) and is 
further evidence of consistency between his behavior and the 
hypothesis that the mixed model applies until the subject believes 
that the process is stationary. 
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Table 2 
Stationarity Specified (9-Choice Case) 
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Code 


100 P 


100 F} for Subject: 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


1 


57 


90 


81 


26 


14 


77 


74 


96 


100 


12 


75 


2 


55 


81 


86 


22 











96 





16 





3 


41 


68 


83 


30 








68 


12 


85 





61 


4 


100 


97 


78 


54 


100 


57 


100 


100 





6 


88 


5 


35 


50 





14 


7 


88 








25 


5 


67 


6 


29 


4 


40 


28 


23 


15 


80 


50 


85 


12 


64 


7 


56 


97 





14 


1 





92 





48 


36 


3 


8 


55 


100 


100 


14 





100 





11 





30 





9 


57 


46 


100 


92 





100 





100 





3 


95 


10 


58 


97 





90 





83 


90 


98 


100 


15 


24 



Code 


100 P 


100 F 5 J for Subject: 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


1 


57 


86 


100 


52 


14 


100 


98 


96 


100 


12 


74 


2 


55 


98 


82 


44 











94 





12 





3 


41 


84 


98 


60 








100 


6 


100 





100 


4 


100 


100 


100 


100 


100 


100 


100 


100 





6 


100 


5 


35 


64 





8 


14 


100 








30 


10 


68 


6 


29 





50 


46 





16 


100 


96 


100 


12 


100 


7 


56 


100 





8 


2 





100 





48 





6 


8 


55 


100 


100 


14 





100 





14 





52 





9 


57 


68 


100 


86 





100 





100 





6 


94 


10 


58 


100 





100 





100 


100 


100 


100 


24 


44 
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The data on subjects 1 - 10 in Table 2 are rather difficult to 
interpret in the light of this central hypothesis. Certainly, there 
is no very striking agreement between P and Fj. It does seem 
to be the usual case that the subject is still choosing columns 
other than the best one through most of the 100 trials in each 
game, and the subjects usually fail to settle down on the very 
best choice within their hundred trials. Of course, it may be 
argued reasonably that the early trials in each game should not 
be taken too seriously in an investigation of asymptotic behavior. 
For this reason, the F^f data are included, but here too the in- 
terpretation is difficult. 

One source of trouble in the analysis of the 9-choice case is 
the similarity between the TTJ, in some instances, that makes it 
difficult for the subject to discriminate between such similar 
columns. For example, in Code 7, the three largest values of 
100 TTj are 98, 95, and 90, so that it would not be at all surpris- 
ing if a subject were to spend some of his choices on the 95% 
column that the mixed model would allocate to the 98% and 90% 
columns. Consequently, in an effort to mitigate this confusing 
effect, the data were regrouped for analysis by combining col- 
umns having comparable TTI values. The (arbitrary) rule for 
this regrouping consisted of combining all data for those columns 
whose TTi values differed by 7% or less from the max TTJ into a 
new column 1, then similarly forming a new column 2 including 
all those columns whose n^ values differed by 7% or less from 
the max TTJ not already included in the new column 1, and con- 
tinuing until all the columns were regrouped. If r^ denotes the 
number of new columns formed for Code j, and TT^ denotes the 
mean of the set of TTJ values represented in the new column k of 
Code j, for k = 1, 2, , r^, then, in accordance with the mixed 
model, the asymptotic frequencies of play p? (on new column k 
for Code j) is given by the following expression: 
1 

= _lrB for (j = 1, 2, . -, 10; k = 1, 2, . - , rl). 



Table 3 presents a comparison of the theoretical values of 
pj, as obtained from (3), with the relative frequency of choice 
of new column 1, in all 100 trials and in the last 50 trials, re- 
spectively. The average relative frequencies were computed for 
all but Subject 9, whose data have been omitted because his se- 
lections differed very greatly from those of the other subjects. 
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Table 3 
Stationarity Specified (Grouped 9-Choice Case) 
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It would be difficult to argue that the data in Table 3 confirm 
the hypothesis of behavior in accordance with the mixed model, 
but even more difficult to argue that they contradict it. The 
author 1 s view is that a much more extensive experiment is nec- 
essary before any real conclusions can be drawn. The most 
promising approach seems to involve beginning with a 3-choice 
case, in which the TTI are rather evenly spaced and not too close 
to or 1, in order to check on the closeness of agreement with 
the Estes-Bush formula (2) for pj. 

The author would not want to undertake the problem of deter- 
mining the precise nature of the experimental stimulus necessary 
for producing this kind of difference in behavior until experimen- 
tal techniques were good enough to give repeatable and constant 
results, for the 3-choice case, in agreement with either the pure 
or mixed models and only if this stimulus problem were under- 
stood reasonably well would he again want to work with the r- 
choice case for r > 3. It is hoped that experience in the crude 
pilot experiments reported on here maybe of some use to others 
who are interested in this problem of whether to accept the pure 
model or the mixed model of learning behavior. 



APPENDIX A 
Instructions for Static-Nine Experiment 

A. Static-Nine 

1. The equipment for static-nine consists of a "punchboard," 
a M punch, n a "code," and a "register. " The code and register 
are printed forms fastened to the back and the front, respective- 
ly, of the punchboard. The punch is used on each move to make 
a hole in the register signifying the choice of an integer from 1 
through 9. Samples of code and register are attached to these 
instructions. 

2. The register has 25 lines, and in each line the integers 1 
through 9 appear in four "fields" across the page; this provides 
for 100 moves. 

3. The code has either a 1 or a in each position. A 1 de- 
notes a win and a denotes a loss. The code is arranged so that 
the mark 1 appears a preassigned number of times in each col- 
umn. The order of the marks 1 and in each column is random. 

4. The first move is made by punching out one of the nine 
digits in row 1 of field 1. If a 1 is seen through the hole this 
position is circled in pencil by the player to denote a win, 
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otherwise it is left uncircled to denote a loss. The second move 
is made similarly by punching in row 2 of field 1, and circling 
to denote a 1 if observed. After the 25 moves in field 1 are 
completed, start at the top of field 2, and continue in this way 
until all 100 rows have been punched. The score on these 100 
moves is the total number of circled positions. 

5. After you have made 100 trials, you will give the umpire 
instructions for your plays in the next hundred trials. You do 
this by assigning to each choice, 1 through 9, a number indicating 
how often you wish that choice played in the next 100 trials. The 
nine numbers must add to 100. For example, you might assign 
20, 32, 13, 10, 25 to digits 1, 3, 7, 8, 9 respectively, and zeros 
to the others, indicating that you want digit 1 played 20 times, 
digit 2 played no times, digit 3 played 32 times, etc. In par- 
ticular, if you wish to have some one number (say 5) played all 
the time, then you would assign 100 to that one and zero to each 
of the others. These numbers should be written in the row pro- 
vided on the data form for this purpose. 

6. Your object is to get as many wins as you can in the 200 
rows. 

B. General Information 

1. This is an experiment designed to obtain a quantitative 
comparison of the ability of people with that of rats in learning 
to play a certain simple intellectual game. Bush and Mosteller, 
at Harvard, have examined experimental data obtained by psy- 
chologists in their studies of rat learning. They have developed 
a mathematical model that seems to fit the rat data quite well. 

I am using this model, which they have called the n stat-rat, ft to 
compute the probable performance of rats in playing von Neu- 
mann- Morgenstern games. The scientific purpose is to test the 
validity of various mathematical theories of learning and de- 
cision. 

2. The punchboard represents a mechanical umpire who de- 
termines wins and losses in the following manner. The umpire 
first chooses nine numbers between and 1 (for example, .500 
or .378) from a random number table; these are thought of as 
probabilities Gj, 62, , Gg that your choices of 1 through 9 
will win. Each time you play a number (say 3) the umpire deter- 
mines whether or not you have won by applying the corresponding 
probability (63 in this case) to make the decision; he again uses 
a random number table for each of those decisions. Thus, if the 
probability 63 that your choice 3 would win was .700, you might 
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expect to win 7 times out of 10 when you choose the number 3. 

3. For the second hundred trials the umpire simply multiplies 
the nine numbers written on your data form by the corresponding 
probabilities Gj, 63, , Gg, and adds these products together 
to get your total of wins on the second hundred trials. Of course, 
this computational procedure produces the same result as the 
one that you would expect to get if the umpire actually went 
through the second hundred plays for you one by one. 

4. We should like to have you play static-nine several times. 
Your average score for all your games will be compared with 
that of the stat-rat, and those of the other subjects in this ex- 
periment. You will be told the s tat- rat 1 s score after each game. 
At the conclusion of the experiment, I will send each subject a 
recapitulation of the names and scores of all players, including 
the stat-rat. 

5. Thank you for participating, and good luck. 



APPENDIX B 
Instructions 

A. Static-two 

1. As in Appendix A, paragraph Al. 

2. The register has 25 lines, and in each line the integers 1 
through 9 appear in four "fields" across the page; this provides 
for 100 moves. At the top of each register you will find two 
digits written in red; you are to punch only one or the other of 
these two on each trial. 

3. The code has either a 1 or a in each position. A 1 de- 
notes a win and a denotes a loss. 

4. The first move is made by punching out one of two digits 
in row 1 of field 1. If a 1 is seen through the hole this posi- 
tion is circled in pencil by the player to denote a win; otherwise 
it is left uncircled to denote a loss. The second move is made 
similarly by punching one of the two digits in row 2 of field 1, 
and circling to denote a 1 if observed. After the 25 moves in 
field 1 are completed, start at the top of field 2, and continue 
in this way until all 100 rows have been punched. The score on 
these 100 moves is the total number of circled positions. 

5. Your object is to get as many wins as you can in the 100 
moves. 
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B. General Information 

1. As in Appendix A, paragraph Bl. 

2. We should like to have you play static-nine several times. 
Your average score for all your games will be compared with 
that of the stat-rat, and those of the other subjects in this ex- 
periment. At the conclusion of the experiment, I will be glad 
to give you a recapitulation of the names and scores of other 
players, including the stat-rat, if you wish. 

3. Thank you for participating, and good luck. 

FOOTNOTES 

1. The present volume, hereinafter referred to as (DP), includes 
a paper by Estes reporting on these results. 

2. Even though we stressed this point greatly, it was not really 
accepted by two colleagues who, as subjects in one experiment, 
made inferences from the illustrative example in the written 
instructions (and from apparent patterns of successes and 
failures in early trials) that cannot be justified on logical 
grounds if the assumption of stationarity were really accepted 
by them I 

3. Samples were shown to each subject but are not included here. 
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CHAPTER XIX 

SOME EXPERIMENTAL n-PERSON GAMES 

by 
G. K. Kalisch, J. W. Milnor, J. F. Nash, E. D. Nering 

UNIVERSITY OF MINNESOTA, PRINCETON UNIVERSITY, 
MASSACHUSETTS INSTITUTE OF TECHNOLOGY, 
UNIVERSITY OF MINNESOTA. 

INTRODUCTION 

This paper reports on a series of experiments designed to 
shed light on some of the concepts important in the theory of 
n-person games. Interest was mainly in games of cooperation, 
and, in particular, the steps which lead to an agreement to co- 
operate. Thus the mechanics of bargaining, negotiation, and co- 
alition formation were important features of these games. Most 
of the experimental games were formally of the type considered 
by von Neumann and Morgenstern [4]. They and others, (see 
Sections I. 6 and II and items [l], [2] in the bibliography) have 
defined various theoretical concepts for such games. Our prin- 
cipal aim was to compare these concepts with the results of 
actual plays. 

In addition, the bargaining process itself proved to be Inter- 
esting. Discussion of various details of this process as it oc- 
curred in our experiments is included not only to provide the 
context in which the results should be evaluated, but to enable 
future designers of such experiments to profit from our experi- 
ence. In particular, it was interesting to observe that personal- 
ity differences played a very important role in determining a 
subject's measure of success in playing these games. The sub- 
jects were four men and four women, including five college stu- 
dents, two housewives, and one teacher. They were a remarka- 
bly intelligent and cooperative group. Certainly all understood 
the rules and could analyse their positions as well as could be 
expected of persons not specifically trained in such negotiations. 
The subjects were given chips to use in making the payments in- 
volved in playing the games, and at the end of each of the two 
days on which we ran experiments, we redeemed the chips for 
money. The players who ranked third and fifth on the first day 
moved up to first and third places, respectively, and otherwise 
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the relative ranking was unchanged. That this was due almost 
entirely to personality differences was apparent to the observ- 
ers. This conclusion may also be inferred from the discussion 
in section 1.3. 

Some of the games had features which distinguished them 
from games of the von Neumann and Morgenstern type [4]. One 
experiment concerned a game in which side-payments were not 
permissible. In others, the negotiation procedures were formal- 
ized (e.g., the identities of a player's opponents were concealed 
from him and he was allowed to bid, accept, decline, or counter- 
bid in a very limited number of ways through the intermediation 
of a referee). The construction of a theory to deal with an un- 
limited or very large number of negotiation possibilities is as 
yet so difficult that it seems desirable to restrict and severely 
formalize the negotiation procedure to the point where a mean- 
ingful theory can be constructed. These experiments were es- 
sentially pilot experiments and served to test the workability of 
formal negotiation models. 

In general, the authors think that these experiments were 
fruitful and indicate that further experiments along these lines 
are feasible and should prove valuable for the further develop- 
ment of game theory. The field of n-person games has had very 
little empirical investigation; for this reason, and because of the 
relatively undeveloped status of the theory, the authors feel that 
the use of the experimental approach is strongly indicated. 



I. COOPERATIVE GAMES WITH SIDE PAYMENTS 

1. Description of Games. Six constant-sum games, of the 
type studied by von Neumann and Morgenstern, [4] (that is, co- 
operative games with side payments allowed), were played by 
our experimental subjects. Four of these were four-person 
games which were played eight times each. A five-person game 
was played three times and a seven-person game was played 
twice. These games were presented simply by giving their char- 
acteristic functions. The players were rotated after each play, 
to discourage permanent coalitions. Perhaps the best way to 
describe these games is to give a digest of the material which 
was given to the subjects. This is done in the following. Note 
that games 1 and 4 are strategically equivalent and that games 
2 and 3 are strategically equivalent. Game 3 is just the sym- 
metric four-person game. The five- and seven-person games 
are taken from examples in [4]. 
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Instructions to the Subjects, 

a) General Instructions, Approximately one typewritten page 
of general instructions was issued before the subjects started 
the experiments proper. 

First, the subjects were acquainted with the general aim of 
the experiments ("to further the experimenters' understanding 
of the Theory of Games*); the system of rotation alluded to 
above was explained. Next, the possibility of forming coalitions 
was explained. A coalition was stated to have been formed 
whenever a set of players had decided to act in concert and had 
decided how to distribute among its members its common gain 
or loss. Finally, this part of the instructions sought to empha- 
size the aspects of aggressiveness and selfishness the players 
were to exhibit during the experiment; they were exhorted to act 
only on the merits of the game situation as such (their ultimate 
objective being the maximization of their gain) - without refer- 
ence to personal preferences or outcomes of previous runs. 
They were told that modest monetary rewards were to be dis- 
tributed at the end of the experiment in proportion to the totals 
of points (represented by chips which was the universal mode of 
exchange during the experiment) earned during their ludal ac- 
tivities. 

b) Specific Instructions. The "characteristic function* was 
explained to the players as the rule determining the total pay- 
ment to each coalition in existence at the end of the game. Fur- 
thermore, the characteristic function was given to them diagram- 
matically (see Figure 1, game # 1, for sample). The players 
were told that their objective was to form "final coalition agree- 
ments* which would determine both a set of players, and the dis- 
tribution of any gain or loss accruing to the coalition. These 
final coalition agreements were then to be communicated to an 
umpire who recorded them, read them back, and ascertained 
that they reflected the will of the group. The latter provision 
was necessary, particularly in the case of the seven-person 
game. This type of formal agreement was binding on the players 
(enforced by the umpire) but was to be preceded by informal and 
tentative bargaining which was not binding. In addition to the 
formal "final coalition agreement* there also were possibilities 
of various well defined kinds of formal and binding intermediate 
agreements which again could be preceded by informal bargain- 
ing, and which were communicated to the umpire who recorded 
them and enforced their terms. 
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Game 6 

Number of Players Payoff to 

in a Coalition the Coalition 

1 -40 



3 -20 

4 20 

5 

6 40 

Table I - Coalitions are indicated by the letters of the players 
they contain. The entries indicate the value of the 
characteristic function of the coalition on the right in 
the game on the top. Games #1-4 are four -per son 
games; game #5 is a five-person game; game #6 is a 
symmetric seven-person game. 



A run was over at the end of 10 minutes or whenever there 
was no further possibility or desire to form further coalitions, 
if that occurred before 10 minutes had elapsed. The players 
were permitted to include in their coalition agreements the al- 
lotment of payments to outsiders. (This was never done, how- 
ever.) 

2. General Discussion. There was a tendency for members 
of a coalition to split evenly, particularly among the first mem- 
bers of a coalition. Once a nucleus of a coalition had formed, it 
felt some security and tried to exact a larger share from subse- 
quent members of a coalition. The tendency for an even split 
among the first members of a coalition appeared to be due, in 
part, to a feeling that it was more urgent to get a coalition 
formed than to argue much about the exact terms. 

Another feature of the bargaining was a tendency to look upon 
the coalitions with large positive values as the only ones worth 
considering, often overlooking the fact that some players could 
gain in a coalition with a negative value to their mutual benefit 
(this was especially noticeable in Game 2, where B and C were 
always together). 

Coalitions of more than two persons seldom formed except by 
being built up from smaller coalitions. Further coalition form- 
ing was usually also a matter of bargaining between two groups 
rather than more. 
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A result of these tendencies was that the coalition most likely 
to form was the two-person coalition with the largest value, even 
though this coalition did not always represent the greatest net ad- 
vantage for the participants; and this coalition usually split even- 
ly. Thus it frequently happened that the player with apparently 
the second highest initial advantage got the most of the bargain- 
ing. The player with the apparently highest initial advantage 
was most likely to get into a coalition, but he usually did not get 
the larger share of the proceeds of the coalition. 

Initially the players were more inclined to bargain and wait 
or invite competing offers. This remained true to some extent 
in those games where the situation did not appear to be symmet- 
ric. However, later and in those games which were obviously 
symmetric, the basic motive seemed to be a desire to avoid be- 
ing left out of a coalition. Hence there was little bargaining, and 
the tendency was to try to speak as quickly as possible after the 
umpire said "go," and to conclude some sort of deal immediately. 
Even in a game which was strategically equivalent to a symmet- 
ric game, the players did not feel so rushed. A possible reason 
might be that some players felt they were better off than the 
others whether or not they got into coalitions, while others felt 
that they were worse off whether or not they got into coalitions. 
They seemed to pay little attention to the fact that the net gain of 
the coalition was the same to all. 

It is worth pointing out that in an interview after the runs, one 
of the subjects stated that she did appreciate the fact that the fa- 
vored player did not strike the best deal. When she was in the 
position of the favored player she felt that to demand her due 
would seem so unreasonable to the others that she wouldn't get 
into a coalition, and when she was not the favored player, it was 
not to her advantage to say anything about it. 

Personality differences between the players were everywhere 
in evidence. The tendency of a player to get into coalitions 
seemed to have a high correlation with talkativeness. Frequently, 
when a coalition formed, its most aggressive member took charge 
of future bargaining for the coalition. In many cases, aggressive- 
ness played a role even in the first formation of a coalition; and 
who yelled first and loudest after the umpire said "go* made a 
difference in the outcome. 

In the four-person games, it seemed that the geometrical ar- 
rangement of the players around the table had no effect on the 
result; but in the five-person game, and especially in the seven- 
person game, it became"quite important. Thus in the five-person 
game, two players facing each other across the table were quite 
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likely to form a coalition; and in the seven-person game, all co- 
alitions were between adjacent players or groups of players. In 
general as the number of players increased, the atmosphere be- 
came more confused, more hectic, more competitive and less 
pleasant to the subjects. The plays of the seven-person game 
were simply explosions of coalition formation. 

Despite the exhortation contained in the general instructions 
to instill a completely selfish and competitive attitude in the 
players, they frequently took a fairly cooperative attitude. Of 
course, this was quite functional in that it heightened their 
chances of getting into coalitions. Informal agreements were al- 
ways honored. Thus it was frequently understood that two play- 
ers would stick together even though no explicit commitment 
was made. The two-person commitments which were made were 
nearly always agreements to form a coalition with a specified 
split of the profits, unless a third player could be attracted, in 
which case the payoff was not specified. This left open the pos- 
sibility of argument after a third party was attracted, but such 
argument never developed. In fact, the split-the-difference prin- 
ciple was always applied in such cases. 

In the seven-person game the characteristic function made 
coalitions with an even number of players much more desirable 
than those with an odd number of players. With the bargaining 
procedures being used, this made it inevitable that one player 
should lose heavily. There was a feeling among some of the 
players that no individual should lose twice in a row, so perhaps 
a rotation system would have developed if there had been more 
plays. (This was also indicated in subsequent interviews). 
There was, however, no tendency for the winning coalition to 
pay any compensation to the loser. 

The subsequent discussion will be based on the assumption 
that the utility of an outcome to a player, in the sense of von 
Neumann and Morgenstern [4] is directly proportional to the 
number of chips won. Of course, this is far from true. For ex- 
ample, it is hard to prevent the subjects from making a distinc- 
tion between plays in which they gain and plays in which they 
lose. Thus the graph of utility against chips won might look like 
the graph in Fig. 2. 

It is very difficult to judge whether phenomena such as this 
were significant. It was noticeable, however, that some players 
had a convex utility function which expressed itself in a desire 
to randomize, while others did not like to randomize. It the last 
play of Game 3, for example, all four players randomized to see 
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which three-person coalition would form. One player objected 
strenuously to this procedure on the grounds that it would be un- 
fair to the player who was left out. She was forced to comply, 
however, by the threat of a three-person coalition against her. 



Utility 



\ I h 

-3 -2 -1 



t 



1 2 3 



Chips won 



Fig. 2 



3. Compatibility with the Shapley Value [2], The Shapley 
values for different positions are compared with the average ob- 
served outcomes in Fig. 3. More detailed numerical data will 
be found in charts 2-7. There seems to be a reasonably good 

fit between the observed data and the Shapley value, considering 
the small number of plays which are being averaged. There is, 
however, a tendency for the actual outcome to be more extreme 
than the Shapley value. This is especially noticeable in Game 1. 
Apparently this is due to the fact that coalitions with large pay- 
offs were much more likely to form. Thus the players with rela- 
tively high Shapley values were rewarded not only by high payoffs 
when they got into coalitions but also by a strong tendency to be 
in coalitions. In Game 2 and in the five-person game, where 
there were two players whose Shapley values were noticeably 
high, the second highest player did just as well as the first. (See 
the discussion in Section 2.) 

4. Compatibility with Strategic Equivalence [4]. If graphs 
analogous to Fig. 3 are constructed showing the average out- 
comes for these games after transforming them into some nor- 
malized form by strategic equivalence, we might expect that 
strategically equivalent games (1 and 4; 2 and 3) should show 
fairly close agreement; also the symmetries existing in the 
games (A and C are symmetric in Game 1; all players are sym- 
metric in Game 3; B, C, D, E are symmetric in Game 5) might 
be expected to be reflected in such graphs. The actual results 
don't agree very well with these hypotheses. The discussion in 
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COMPATIBILITY OF SHAPLEY VALUE WITH OBSERVED OUTCOMES 

Game 1 
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Section 2 hints at some possible reasons for this discrepancy. 
Thus, the fact that coalitions with high characteristic function 
are most likely to form, the tendency of members of a coalition 
to split evenly, and the non-linearity of the utility function all 
tend to disrupt the concept of strategic equivalence. 

5. Compatibility with von Neumann-Mo rgenstern Solutions [4]. 
It is extremely difficult to tell whether or not the observed re- 
sults corroborate the von Neumann-Mo rgenstern theory [4]. This 
is partly so because it is not quite clear what the theory asserts. 
According to one interpretation a * solution* represents a stable 
social structure of the players. In order to test this theory ade- 
quately, it would probably be necessary to keep repeating a game, 
with a fixed set of players, until there seemed to be some sta- 
bility in the set of outcomes which occurred. One could then see 
to what extent the outcomes of this final set dominate each other 
and to what extent other possible imputations are not dominated 
by them. 

This suggests that we find out to what extent the observed out- 
comes for each game dominate each other. This is not too fair 
a test of the theory, but not much more can be done with the 
available data. The following are all the significant dominations 
(denoted by > ) which occur between the eight observed outcomes 
for each of the four-person games. Dominations which occur be- 
cause of a difference of one chip or so are ignored. 

Game 1. 2, 6, 7 >3; 
3,4,7>1; 

5 >1,2, 3,4,6,7, 8. 

Game 2. 1 >2, 4; 3 >1, 7, 8; 
4>3, 5; 5 >1,7; 

6 >1,7;7>1, 2,4; 
8 >1, 2, 6. 



Game3. 3 > 1, 5, 7, 8; 4 > 3. 

Game 4. 2, 3, 8 >6; 
6 >1; 3 >2. 

Thus for Game 2, so many dominations occur that the set of 
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outcomes cannot be related to any solution. For Game 1 there 
is some hope, since outcomes 2, 4, 6, 7 and 8 do not dominate 
each other. 

For Game 3, the situation is quite satisfactory. Outcomes 1, 
2, 4, 5, 6, 7, and 8 do not dominate each other, and furthermore 
they are clearly related to a familiar solution to this game: 
namely, the solution consisting of (10, 10, 0, -20) and its permu- 
tations together with (0, 0, 0, 0). The other point, 3, also belongs 
to a familiar solution: that consisting of (6 2/3, 6 2/3, 6 2/3, 
-20), (6 2/3, 6 2/3, -6 2/3, -6 2/3) and their permutations. 
These results are more or less explained by the symmetry of 
the game, so it is not clear that they are significant. 

For Game 4, the outcomes 1, 3, 4, 5, 7 and 8 do not dominate 
each other. It would be necessary to try extending this set to a 
solution in order to test the significance of this result. 

In another interpretation of the theory, a solution represents 
a collection of outcomes under consideration by the players at 
some state of the bargaining procedure. Thus, a solution refers 
to a single play of a game rather than a number of plays. It is 
very difficult to tell just what outcomes were under considera- 
tion by the players at any particular time. What was actually ob- 
served was a number of offers, made sequentially. Frequently 
each offer dominated the immediately preceding offer. A more 
thorough study of the data would be necessary, however, actually 
to refute this interpretation. 

An example of a case where such an interpretation did seem 
reasonable is given by the following. In one play of a four- 
person game, a coalition of two was formed, but the agreement 
was made in such a way that the coalition could not be expanded. 
This left the other two players with a pure bargaining situation. 
Actually they chose to split evenly, rather than go through the 
ordeal of bargaining. If they had bargained, however, the set of 
possible outcomes would have formed part of a Shapley quota- 
type solution [3]. 

6. Compatibility with * Reasonable Outcomes*. The following 
definitions, due to J. W. Milnor, establish several bounds for the 
amounts which a player i or a set of players S should get in any 
play of an n-person game. In particular, upper bounds b(i) and 
b(S) are defined by the following: 

b(i) = Max (v(S) - v(S - i)), 
i S 
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b(S) = Max (v(S') - v(S' - S)). 
S 1 D S 

A theoretical plausibility consideration for these definitions is 
given by the fact that one would not expect a player i to exact 
more from a coalition S of which he is part than his presence 
contributes to it, i.e., v(S) - v(S - i) should be as much as he 
could reasonably expect as his share. The set B of imputations 

oe = {c^pi $b(i)J is a rather large set - for example, in the 

normalized zero-sum essential three -per son game (see [4]), B 
equals the set of all imputations. It can be proved that B always 
contains the Shapley value (see [2]), and also every solution in 
the sense of von Neumann and Morgenstern (see [4]). Section 7 
contains comparisons between b(i), and the actual outcomes. It 
appears that b(i) is usually compatible with the outcomes, in 
particular in the case of the four-person games (a deviation in 
one of the cases was subsequently related to the statement made 
by one of the subjects after the experiment was over that he had 
made a mistake in reasoning),, The agreement is less favorable 
in the case of the five-person game - some players getting more 
than their "maximum share* b(i) - a circumstance which may be 
related to the fact that the players rushed into coalitions, split- 
ting the payoff evenly, without really studying the strategic pos- 
sibilities. 

A lower bound, ft(S), was also defined for the payoff to each 
coalition. Its definition is the following: 



Jl(S) = Min(v(Si) + v(S -Si)); L= all imputations a= {aM L a 
SicS \iS 

It is not hard to see that Jt(S) = v(I) - b(I - S) in the constant sum 
case - which makes Jt(S) a plausible candidate for a lower bound. 
It is possible to show that under rather strong assumptions (viz., 
bargaining will always result in the formation of just two oppos- 
ing coalitions; in order for a coalition T to form it must distrib- 
ute its payoff v(T) among its members in a way which is "stable" 
in the sense that every subset Tj of T is given at least v(Tj) ) 
the outcome of a play must lie in L; a v. Neumann-Morgenstern 
solution [4] may fail to be contained in L, nor need L contain 
the Shapley value [2] of the game. The sixth game (a seven- 
person game) was an example of a game for which there exists 
a v. Neumann-Morgenstern solution giving to a set S less than 
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1(S) - the quota solutions of this game contain outcomes such as 
(-40,40,0,0,0,0,0) giving some five-person sets -40 whereas 
A(S) = -20 for all five-person sets. It turned out that the actual 
minimum outcomes for five-person sets S were in 11 instances 
substantially higher and in one instance equal to j(S). An out- 
come (payoff) f ailing within the limits defined above is called a 
* reasonable outcome*. 

7. Numerical Data. The actual outcomes from these games 
are given in the following tables. Expected outcomes are given 
in the few cases where the players randomized. Game 1 is a 
constant-sum game with v(I) = 80. The others are all zero-sum 
(the characteristic functions are given in Section 1). 

The commitments listed in the last column include only for- 
mal agreements processed through the umpire. Many 
information agreements were made and kept. The numbers 
b(S) and the Shapley value are to be construed in the light of 
Sections 3 and 6 above. 



H. EXPERIMENTAL WORK WITH NEGOTIATION MODELS 

A negotiation model is a non-cooperative game [l] which is 
based on a strictly formalized negotiation procedure applied to 
a cooperative game. In principle such a model can be studied 
and analyzed in terms of equilibrium point theory [3] as a non- 
cooperative game, and the kind of solution thus obtained may 
then be considered as a possible solution of the underlying coop- 
erative game. One of the aims of the experiments related in this 
paper is to gain information on the workability of the model. 

Actually, we use the phrase 'negotiation model* somewhat 
more broadly than is implied above. The model, as a non- 
cooperative game, may not have anything like a really satisfac- 
tory non-cooperative solution. It may only be an intermediate 
model which would have to be further modified (perhaps by put- 
ting in preceding commitment moves) before anything very satis- 
factory for a theoretical solution would be obtained. Some such 
modification may show up if the game is actually played by the 
subjects. Another possible aim of experimentation may be to ob- 
serve the effects of repeated playing of the same game by the 
same set of players. This may make the (negotiation) game 
more cooperative because information can then be transmitted 
and inferred by the history of the previous plays; precedents can 
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be established; players may convince others that they will con- 
tinue to behave in certain ways in future games (for instance, by 
a persistent use of a certain strategy). 

We performed experiments on a three-person model and on a 
four-person model. In both cases the players 1 communication 
with each other was strictly formalized and was effected through 
the umpire. 

In the three-person game the players were blindfolded and in- 
dicated their moves to the umpire by means of hand signals. In 
the four-person games each player sat where he was unable to 
see any of the other players (of whose identities he was unaware) 
and made his moves by writing them on paper. 

The rules of the three-person game were as follows: 

First move: Player A may either (1) wait or (2) make an of- 
fer to form a coalition with either player B or C; this offer is to 
specify the share d^ of the proceeds of the proposed coalition 
which A intends to obtain for himself (d^, etc., must be integers). 
Players B and C make similar first moves- -and all three make 
their moves simultaneously and independently. 

If two players, say A and B, have made offers to each other, 
and if d^ + dg <^ 15, the game is over and the payoffs are the fol- 
lowing: A gets d^, B gets d B , C gets -(&&+ <%) H dA + de > 15, 
all three get zero. 

If a coalition was formed, the play is over, and all three play- 
ers get zero. 

H, say, A has 'waited* at his first move and was offered a co- 
alition by another player, he has a second move where he may 
either choose to accept or to reject. In either case, the play is 
over. The payoffs in the first case are (if A accepts B's offer): 
A gets 15 - da, B gets dg, C gets -15. In the second case all 
three players get zero. 

This game has been treated theoretically. Let us change the 
normalization, so that a coalition gets +1, the outsider 0. A de- 
mand can be any real number between and 1, and the payoff is 
1/3, 1/3, 1/3 when no coalition forms. K the above-described 
game were changed by disallowing demands greater than 1/2, the 
new game would have a symmetric equilibrium point where each 
player offers with probability .544, with probability .456, and dis- 
tributes his demands between .415 and. 500, the average demand 
being .455. 

The observed behavior of the players corresponds fairly 
closely to this strategy; 85 per cent of the demands were within 
the theoretical range. Waiting was somewhat neglected, occur- 
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ring only 33 per cent instead of the theoretical 41 per cent. The 
theoretical strategies are not quite at equilibrium in the game 
with unrestricted demands, for the demand of 2/3 - would be a 
strategy with an expected payoff of about .34, a little more than 
1/3, which is the expectation for demands in the lower range. 
Of course, such high demands would not be advantageous against 
the actual play pattern in the experiment; waiting was not fre- 
quent enough. 

The true non-pathological equilibrium point for this game 
seems to be the same as for the game in which only coalitions 
formed by one player offering and the other player waiting and 
accepting are allowed. Here the offering probability is only 42 
per cent, and demands start at .53 and run up to 2/3, with most 
of the weight of the distribution at the low end. This distribution 
is completely non-overlapping with the distribution described 
before. 

The four-person games were derived by formalizing the nego- 
tiation from two of the cooperative games which were investigat- 
ed in the part of our experiment devoted to free bargaining 
(namely games 3 and 4). Again all players moved simultaneous- 
ly and independently at each stage, all players being informed, 
after each move, of the choices made by the others. There were 
two kinds of moves: one in which offers may be made, and one 
in which offers made in the move immediately preceding may be 
accepted by a player who had passed "waited* during that pre- 
ceding move. A more detailed description can perhaps best be 
gleaned from the presentation of a sample play of the game. 

Example 
(This is based on Game 4) 



Player 


A 


B 


C 


D 


Proposal 


ABC 


AB 


AC 


DC 




6 


6 









This is the first move. Here each player has 
made a proposal. The number under the letter 
of a player indicates his demand, the set of let- 
ters the proposed coalition. No potential coali- 
tion has the property that all its members have 
either named it or waited. If all had named it 
and had made mutually compatible demands it 
would have been formed at this first move. 
Once formed, coalitions are permanent unless 
enlarged. 
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Player 


A 


B 


C 


D 


Proposal 


Wait 


ADD 
6 


ABC 
6 


DC 




This is the second move. Since no coalition can 
arise from the first move, the second move pro- 
ceeds again afresh, except for the information 
from Move 1. Still no coalition can form. 



Player 


A 


B 


C 


D 


Proposal 


Wait 


ABD 
6 


Wait 


Wait 



This third move makes it possible for ABD to 
form and an acceptance move is inserted in the 
game. In the acceptance move each of the play- 
ers involved, A and D in this case, can reject or 
accept the proposed coalition. Acceptance must 
be associated with a demand for a certain pay- 
off, and is conditional on obtaining the amount 
demanded. 



Acceptance Move; 



Players 


A 


B 


C 


D 


Proposal 


ABD 
6 


No 
Move 


No 
Move 


ABD 

7 



Now A and D have made their acceptance pro- 
posals. The coalition ABD is successfully 
formed because 20 points are available to ABD 
and only 6+6+7=19 have been spoken for. The 
play is over now because the coalition formation 
is complete. (Since we were working with zero- 
sum games we excluded the coalition of all four 
players.) A, B, and D are paid their demands, 
6, 6 f and 7 respectively. C receives -19 to make 
the game zero-sum. 
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There are not many data from this part of the experiment, 
but it showed that the model was workable. The number of 
moves per game decreased as the players learned better strate- 
gies than they began with. Afterwards, the subjects reported 
that they enjoyed this part of the experiment more than any other 
because it didn't involve the exhausting face-to-face haggling. 

Thus further experiment with negotiation models may well be 
profitable. 



m. A STOOGE GAME 

The "stooge game* was a zero-sum three-person game with 
(integral valued) side payments. Each play proceeded as follows 
At the end of four minutes, or when the players reported agree- 
ment to the observer (whichever was the earlier), the following 
payoffs were made: if no coalition was formed each player got 
0; if a coalition was formed (the formation of a coalition implied 
an agreement on how to split its proceeds), it received 10 chips 
from the odd player. A coalition was permitted to make a pay- 
ment to the odd player (this, however, was never done). There 
were three runs of five plays each. The stooge had instructions 
to offer a 7 - 3 split to one or the other of the two subjects in the 
majority of cases, three to himself and seven to the other player. 

During the first run, the stooge had difficulties establishing 
himself, since the game proceeded very rapidly and was over in 
a matter of minutes or even less. During the second run, the 
stooge did manage every time to get a 7 - 3 split and during the 
third run, the subjects "caught on* and the 7-3 split offered by 
the stooge occurred only twice (once it occurred between the 
two subjects),, 

In spite of occasional statements made by the subjects that 
they wanted to play the game non-competitively, the stooge's be- 
havior caused a very competitive atmosphere and there was 
sharp and intense bargaining all the time. Two tendencies that 
could be observed were that first coalitions often persisted, and 
that tentative coalitions formed almost at once. In general, the 
plays proceeded very rapidly. 
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One of the purposes of this experiment was to see if a com- 
petitive atmosphere could be induced by a stooge- -and, as ex- 
pected, this was possible. Another purpose was to see if von 
Neumann-Mo rgenstern's discriminatory solution of a three - 
person game (see [4]) could be established. This definitely was 
not achieved in this experiment. 



IV. A THREE -PERSON COOPERATIVE GAME 
WITH NO SIDE PAYMENTS 

In an effort to get some ideas about the proper theory for 
games without side payments, the following three-person game 
was played. Each player was given two playing cards, each rep- 
resenting one of his opponents. A player moved by placing one 
of his cards face down on the table, thus a voting " for one of his 
opponents. If some player got two votes, then he was awarded 
40 chips and each of the other two was penalized 20 chips. If 
each player got only one vote, then there was no transfer of 
chips. The actual play was preceded by a bargaining period dur- 
ing which the players could make any deals or commitments they 
wished subject only to the following two restrictions: (1) there 
must be no side payments involved, and (2) the deal or commit- 
ment must refer only to the individual play of the game under 
consideration. There were three runs of four, each run involv- 
ing a new set of players. 

The results of this experiment were rather negative. The 
players were simply unwilling to play competitively. In two of 
the three runs the players equalized the expectations either by 
playing randomly or by rotating the winning player over three 
plays, violating prescription (2). In the other run some of the 
plays were still random although there was a tentative deal of 
the form "if you vote for me I will vote for you," and a threat of 
the form "unless you vote for me I will vote for him/ 1 It is of 
course extremely difficult to bargain effectively in this game, 
but it had been hoped that some more positive result would be 
attained. 

The following suggestions may be made in case experiments 
of this type are carried on in the future. It would be better to 
play an unsymmetrical game so that there would be no obviously 
fair method of arbitrating the game and avoiding competition. 
The same set of players should not be together repeatedly since 
there is too much of a tendency to regard a run of plays as a 
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single play of a more complicated game. A competitive attitude 
should be fostered by proper and more thorough indoctrination or 
otherwise; possibly the proper use of stooges may induce a more 
competitive mode of behavior (see "Stooge Game*). 
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APPENDIX B 
Michigan Project - Summer Seminar 

List of Papers Presented 



Norman Dalkey - Cooperative Aggregation 

C. H. Coombs, H. Raiffa, and R. M. Thrall - Some Views on 

Mathematical Models and Measurement Theory 
Clyde H. Coombs - The Measurement of Social Utility 
Douglas Lawrence - The Determination of Alternatives in a 

Choice Situation 
Leon Festinger - Some Remarks on the Relationship between 

Mathematics and Social Psychology 

Clifford Hildreth - Alternative Conditions for Social Order ings 
Allen Newell - Experimental Study of Organizations 
Robert J. Wolfson - Notes on Methodology 
John Kennedy - Experiments in Social Sciences 
Robert R. Bush - The Acquisition of Preferences 
G. L. Thompson - A Simplified Bridge Game 
Roy Radner - An Example of Individual Decision Making Under 

Uncertainty 

Merrill M. Flood - Report on Some Experimental Games 
Abraham Kaplan - Philosophical Theory of Value 
John Milnor - Games Against Nature 
Evar Nering - Solutions of n- Person Games 
John Nash - N- Person Games and the Need for a Better Theo- 
retical Basis 
W. K. Estes - Theory and Experiments Concerning Individual 

Behavior in Uncertain Situations 
Paul Kecskemeti - Science and Policy 
L. S. Shapley - Stable Sets in the Theory of Games 
Herbert A. Simon - A Model for Group Interaction 
Oskar Morgenstern - Experiment and Computation in Economics 
Jacob Marschak - Some Remarks on Teams and Organizations 
James T. Culbertson - Neuroeconomy in Hypothetical Robots 
Robert Chapman - Experimental Study of Organizations 
Gerhard Kalisch - Remarks on n- Person Games 
John von Neumann - Remarks on Chess-playing Automata 
Robert R. Bush and G. L. Thompson - General Stochastic Model 
Leo Goodman - Methods of Amalgamation 
Roy Radner and Abraham Kaplan - Questionnaire Approach to # 

Probability 
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W. K. Estes, R. J. Wolfson, and Roy Radner - Group and Indi- 
vidual Prediction 

G. L. Thompson - Trapping Theorems 

R. R. Bush - Estimation of Model Parameters 

Festinger, Lawrence, Kalisch, and Nering - An Experiment on 
Coalitions 

Kalisch, Marschak, Wagner, Fulkerson, and Estes - Experiments 
on Bargaining and Game Theory 

Sam Karlin - Mathematical Aspects of the Learning Model 



